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FIELD OF THE INVENTION 

The invention relates to genes encoding enzymes which 
are fiindamental in the biosynthesis of pimaricin. The 
invention further relates the application of said gene for 
modifying the biosynthesis of pimaricin, as well as for the 
bioisynthesis of new compounds . 

BACKGROUND OF THE INVENTION 



Polyketides, such as pimaricin (in the literature 
also referred to as natamycin, see for its structure Fig. 

15 3 A) , form a large and highly diverse group of natural 

products. Members of said group include confounds having 
antibacterial, antifungal, anticancer, antiparasitic and 
immunosuppressant activities. Despite their structural 
diversity, these metabolites are believed to be synthesized 

20 by micro-organisms by a common pathway in which \inits derived 
from acetate, propionate or butyrate are condensed onto the 
growing chain by a polyketide synthase (PKS) , in a process 
resembling fatty acid biosynthesis, except that the p-keto 
function introduced at each elongation step may undergo all, 

25 part or none of a reductive cycle comprising p-ket ©reduction, 
dehydration and enoyl reduction. The structural variety in the 
group of polyketides arises from the choice of monomers, the 
extent of p-ketoreduction and dehydration, and the 
stereoche mT «i"Ty nf g^.anh chiY-al cienter. Yet further dive rsity 

30 is produced by functionalization of the polyketide chain by 
the action of glycosylases, methyltransf erases and oxidative 
enzymes . 

Modification of complex biomolecules, such as 
polyketides, is an increasingly itt^jortant way of obtaining 
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biologically active compovinds with improved or altered 
properties. Currently, these modifications are usually 
introduced by chemical methods in a directed or random (e.g. 
in combinatorial chemistry) manner. A drawback of these 
5 chemical methods is that they are often performed under 
relatively harsh conditions. Furthermore, they lack 
selectivity and/or sensitivity. Particularly, in the case of 
complex biomolecules having multiple functionalized, reactive 
groups, precautions have to be taken in order to avoid 

10 undesired side reactions. These precautions include for 
instance the introduction of protective groups before a 
desired chemical conversion^ involving two additional process 
steps, as the protective groups must be removed afterwards. 
Bioconversion of simple organic compounds, i.e. 

15 compounds with no or single reactive centers, has been known 
for some time and finds application widely. Examples are the 
oxidation of long chain alkanes using alkane hydroxylation 
systems of Pseudomonas , and epoxidation of alkenes using 
enzyme systems from various micro-organisms. For the specific 

20 modifications required in the biosynthesis of complex 

molecules, like p-lactam antibiotics, polyketide antibiotics, 
anticancer agents, or peptide antibiotics, the large amounts 
of reactive groups present in these molecules are problematic 
for even the simplest treatments, like hydrolysis of specific 

25 bonds. More interesting treatments frequently completely 
destroy the molecule. 



SUMMARY OF THE INVENTION 



30 It is an object of the present invention to provide 

the means to perform specific conversions in complex 
biomolecules, in particular in polyketides, without applying 
the harsh conditions often related to chemical modifications. 
It is further an object of the invention that said 

35 conversions can be carried as part of a biosynthesis of said 
biomolecules, for instance in micro-organisms. 
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The invention provides the objective means in the 
form of three specific genes encoding for enzymes which 
facilitate specific oxidative conversions in the biosynthesis 
of pimaricin. Thus, the invention concerns a nucleotide 
5 sequence comprising SEQ ID NO. 5 (ORFl) , a nucleotide 
sequence comprising SEQ ID NO. 7 (ORF2) , a nucleotide 
sequence comprising SEQ ID NO. 9 (ORF3) , and homologues or 
fragments thereof. 

In the context of the present invention, a homologue 

10 gene sequence is defined as a sequence being for at least 
75%, preferably at least 85%, more preferably at least 90% 
identical, while still encoding proteins having the desired 
enzymatic activity. Fragments of genes are, in accordance 
with the invention, defined as parts of the sequence having 

15 the indicated SEQ ID NO. having at least 90 nucleotides and 
encoding proteins having the desired enzymatic activity. 

A homologue protein sequence is defined herein as a 
sequence being for at least 80%, preferably at least 90%, 
more preferably at least 95% identical, while still having 

20 the desired enzymatic activity. Fragments of proteins are, in 
accordance with the invention, defined as parts of the 
protein sequence having the indicated SEQ ID NO. having at 
least 30 amino acids and having the desired enzymatic 
activity. 

25 Surprisingly, it has been found that the present 

genes can be expressed in different micro-organisms, which 
may be used in the biosynthesis of different biomolecules . It 
has further been found that said genes may be switched off 
(or knocked out) in the St:reptzomyces which is usually used 

30 for the biosynthesis of pimaricin. In this embodiment, not 

pimaricin is produced by said micro-organism, but a modified 

biomol^cule. In addition, it has been foxind that said genes 

may be overexpressed in Strsptomyces , leading to an increased 
production of said micro-organism in the biosynthesis of 

35 pimaricin. 
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DESCRIPTION OF THE DRAWINGS 

Figure 1: Physical map of part of the Pimaricin 
biosynthetic cluster. 
^ Genes: locations of the genes encoding 

polyketide synthases and oxidative genes involved in 
Pimaricin production (not drawn to scale) ; 

Probes: 0.7 indicates the location of the 0.7 
kb fragment used to identify the extent of polyketide 
10 synthase encoding regions; 3.3 indicates the location of the 
3.3 kb fragment used in polyketide synthase gene disruption; 

Cosmids: sizes and numbers of available M 
cosmids covering the chromosomal region encompassing the 
oxidative genes. 

15 

Figure 2: Detailed physical map of the chromosomal 
regions including the oxidative genes. 

Figure 3A: Molecular structure of Pimaricin. 

Figure 3B: Molecular structures of Pimaricin 
derivatives with a reduced oxidation state of C4 and C5 
and/or the carboxyl group at C12. 

Figure 4 : Molecular structures of Amphotericin B and ti 
Nystatin 

DETAILED DESCRIPTION OF THE INVENTION 

^° The functionality of the Pimaricin PKS associated 

genes was initially pursued by comparing their derived amino 

^'-Irt s equpnr p's . with thnne prewpnh in r " ^ ''-''"v1afribaocE i lik e 

EMBL, Genbank, NBRF/PIR, or Swissprot. 

Surprisingly, ORFl appeared to resemble cholesterol 
35 oxidases from several Streptomyces species. The close 
association of ORFl with the Pimaricin PKS suggests an 
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oxidative step in Pimaricin tailoring. A cholesterol oxidase 
encoding gene has not been obseirved previously in a 
polyketide biosynthesis gene cluster. 

Based in similar analyses, ORF2 and 0RF3 resenible 
5 cytochrome P450 dependent monooxygenases from various 
sources. With respect to the biosynthesis of bioactive 
compounds, P450 dependent monooxygenases have been identified 
before in association with polyketide gene clusters, e.g. in 
the Erythromycin and Rapamycin biosynthesis gene clusters. 

10 Only in the Earythromycin case has the specific enzymatic 
action on Erythromycin precursor compounds been proven. 
Essentially all known cases of tailoring oxidation steps act 
on secondary carbon atoms (methylene groups) . Oxidation of 
primary carbon atoms (methyl groups) in polyketide 

15 biosynthesis, as has presently been found, is unprecedented. 
No knowledge is available on the molecular basis of epoxide 
formation in polyketide products, though epoxides are present 
in a few known structures. 

As has been mentioned above, each of the genes ORFl, 

20 ORF2 and ORF3 separately or in combination can be used for 
various purposes, as will be discussed in detail below. 

Targeted inactivation of one or more of the present 
genes, e.g. through marker insertion or replacement with a 
non- functional gene equivalent, will interfere with at least 

25 one (oxidation) step in the Pimaricin biosynthetic route, and 
thus result in modified Pimaricin molecules characterized by 
a different oxidative state. E.g« molecules can be created 
lacking the epoxide function at carbons C4 and C5 , or 
molecules with a modified oxidation state of the carboxyl 

30 group at C12 resulting in an aldehyde, alcohol, or methyl 
group at this position. 

Disruption of chromosomally encoded genes can be 
acconplished by gene replacement strategies: gene replacement 
is preferably carried out using suicide plasmid vectors or 

35 defective phage vectors carrying modified target genes and 
detection or selection marker genes. The various elements 
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useful for such strategies, and how to employ them, are 
described below. 

Target gene modification can be acconplished by 
disruption of the coding sequence by insertion or deletion of 
nucleotides or nucleotide stretches. Such insertions or 
deletions may be of different size. Preferably, they are of a 
size of at least 2 nucleotides, excepting multiples of 3. 
Alternatively, the coding region of the target gene may be 
replaced by that of a (marker) gene conferring an easily 
detectable phenotype to cells transformed with this 
construct. Suitable examples of such replacement genes are 
lacZ, xylE, Green Fluorescent Protein, and genes for the 
biosynthesis of antibiotics, such as erythromycin, apramycin, 
hygromycin, and thiostrepton, and metabolite analogues, such 
as f luoroacetamide . 

Transfer of the disrupted target gene to the 
Pimaricin production host resulting in in vivo gene 
inactivation can be accomplished by using e.g. suicide vector 
systems or a defective phage containing a fragment internal 
to the coding region of the target gene, or a variant of the 
gene inactivated through deletion or insertion of DNA 
stretches as described above, and in addition a detection or 
selection marker. Suicide vectors and defective phages are 
characterized by their inability to propagate autonomously in 
the strain to be transformed and thus cannot by themselves 
stably be maintained. For Streptoinycetes in general several 
suicide systems are available: suicide vectors can be chosen 
from the group of extrachromosomal element based cloning 
vectors available for K. coli, which cannot replicate in 
Streptomyces species, like e.g. pBR322, pUC, CoID, RSFIOIO, 
RK2 and vectors derived from these plasmids. Similarly, 
gtjreptoi/iyue£; plasmid s uhaxacLeiizeJ -by a limited host .tauye 



can be selected that are incapable of stable maintenance in 
the desired host strain. Examples of such narrow host range 
35 plasmids are SLP1.2 and SCP2, and cloning vectors derived 
from these plamids. Still another possibility is to use 
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temperature sensitive variants of Streptomyces wide host 
range plasmids. These plamids are characterized by their 
inability to replicate above a certain (restrictive) 
tenperature • Besides non-replicative plasmids, also defective 
phage vectors have been developed based on the Streptomyces 
phage phiC31 and proven extremely useful for genetic 
analysis. In this regard, it is noted that an extensive 
overview of known Streptomyces genetic engineering techniques 
may be found in Hopwood et al, (D.A, Hopwood, M,J, Bibb, K-F. 
Chater, T. Kieser, C.J. Bruton, H.M. Kieser, D.J. Lydiate, 
CP. Smith, J.M. Ward, H. Schrempf, Genetic Manipulation of 
Streptomyces: A Laboratory Manual, The John Innes Foundation, 
Norwich, England, 1985) . 

The above mentioned suicide constructs can be 
introduced in a desired host cell using transformation 
procedures with isolated DNA, by conjugation from a donor 
microorganism, e.g. an £. coli or Streptomyces strain 
harboring the construct, or via transfection by phage 
particles. All of these methods are well within the knowledge 
of the person skilled in the art . 

Upon introduction of such a construct in the 
microorganism of interest, e.g. Streptomyces natalensis , 
stable maintenance of the introduced genetic infoirmation is 
only possible by integration of the construct in the host 
chromosome, preferably by homologous recombination with the 
chromosomal copy of the target gene. Strains having 
integrated the construct in the chromosome can be detected by 
the expression of the co- introduced marker: in case of a 
detection marker, transformed colonies can be screened for 
acquired properties like conversion of a colorless svibstrate 
into a colored compound {appliccJDle with e.g. the genes lacZ, 
or xylE) or fluorescence (by expression ot e.g. Green 
Fluorescent Protein) . Alternatively, a marker can be used 
which allows selection of transformed strains by acquired 
35 resistance to e.g. antibiotics or toxic metabolite analogues; 
the latter method usually is employed more frequently because 
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only cells with the acquired resistance will be able to grow 
in media containing the antibiotic or toxic metabolite 
analogue. If an internal fragment of the target gene is used 
for the construction of the suicide vector or defective 
phage, integration of the construct into the chromosomal copy 
of the target gene will result in inactivation immediately. 
If the suicide construct or defective phage contains the 
complete target gene or a fragment including the N-terminal 
or C-terminal coding region, though inactivated through 
smaller insertions or deletions, only integration of the 
construct will result in the presence of an active and 
inactive copy of the gene, separated by vector DNA. For 
obtaining a strain with only an inactive copy, a second 
homologous recombination has to take place removing the 
vector sequences and the active copy of the target gene. 
Strains having undergone this second homologous recombination 
can be detected by the loss of the acquired property encoded 
by the co- introduced marker gene. 

Another application of the present genes from the 
Pimaricin gene cluster lies in overexpression of one or more 
of these genes in the natural host ( Strep tomyces natalensis) . 
The expression of the individual genes within the cluster is 
tightly regulated by the cell physiology and/or cluster 
specific regulatory genes. This internal control may be 
appropriate for production of the antibiotic in the natural 
environment, but undesirable for industrial production. 
Overexpression of all genes of the cluster by introduction of 
additional gene copies or altering the controlling elements 
(e.g. promoters or regulatory genes) can boost antibiotic 
production considerably, as has been shown for a.o. 
Actinorhodin production by Streptomyces coelicolor, A similar 

effect can be obtained by ovGrcxprcBaion of specif ically '■. 

those genes encoding enzymes representing in rate limiting 
steps of the antibiotic biosynthesis. 

Additional copies of each of the present genes from 
the Pimaricin biosioithesis gene cluster, either separately or 



in different combinations, can be introduced into 
Streptojnyces nettalensis and increase the efficiency of the 
oxidative reactions leading to the natural Pimaricin 
molecule, thus resulting in strains displaying an improved 
5 Pimaricin production, expressed either in an increased 
Pimaricin titre in the culture broth or a higher product 
yield on substrate consumed. Of course, enhanced expression 
of certain genes can also be combined with inactivation of 
other genes, thus effecting improved production of variants 

10 of Pimaricin as described above. 

Strains containing additional copies of target genes 
can be obtained through introduction of complete genes 
including expression signals (promoters) on the production 
host chromosome by techniques employing suicide vectors or 

15 defective phage as described above. Alternatively, 

autonomously replicating DNA molecules derived from phage 
genomes or extrachromosomal elements like plasmids can be 
used to carry the additional genes. Over the past two decades 
many naturally occurring Strep tojnyces plasmids have been 

20 turned into efficient cloning vectors, the most commonly used 
vectors being derived from plasmids pIJlOl and SCP2 . Other 
vectors can be constructed based on the plasmid naturally 
occurring in Streptomyces natalensis , as disclosed in GB 
patent application nr 2210619 (Antibioticos) , using selection 

25 and/or detection markers similar to those employed for the 
pIJlOl derived vectors, such as pIJ702, pIJ486, with or 
without added markers as described above. 

For gene expression a large variety of promoters 
efficiently directing transcription of genes in Streptomyces 

30 is availcJDle; an example of a constitutive promoter is the 
ermE promoter, directing es^ression of the erythromycin 
resistance gene from saccharopolysporei erythraea, whereas the" 
agarase gene promoter from S.coelicolor, the promoter of the 
glycerol utilization operon, or the tipA promoter are 

35 examples of promoters inducible by specific s\ibstrates. Using 
techniques known in the art additional promoters can be 
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obtained e.g. promoters endogenous to S.natalensis (see 
J.M.Ward, G.R.Janssen, T.Kieser, M.J.Bibb, M.J.Buttner, 
M.J.Bibb. 1986. Mol. Gen. Genet . 203: 468-478). 

The degree of overexpression can be manipulated by 
the choice of the promoter or the amoimt of inducing 
compound, and in addition by the choice of the autonomously 
replicating vector system; depending on the vector derivative 
used, predetermined plasmid copy numbers can range from 1 or 
2 to 500. It is well within the expertise of the normal 
person skilled in the art to adjust the vector system to the 
desired degree of overexpression. 

Both of the above uses of the present genes from the 
Pimaricin biosynthetic gene cluster, i.e. inactivation to 
obtain new variants of Pimaricin and overexpression to 
increase Pimaricin productivity, can also be applied to 
strains producing structurally similar bioactive compounds 
such as Anphotericin B (S Crept ojnyces nodosus) , Nystatin 
{ Strep toinyces noureei) (see Figure 4) etc.. to obtain variants 
of these compounds or improve productivity, or both. Using 
the present genes to inactivate the corresponding genes in 
other Streptomyces species will results in new derivatives 
of, inter alia, nystatin and amphotericin B altered in their 
oxidative state. 

Still another application of the present genes is the 
25 heterologous expression and exploitation of the enzymatic 

activity encoded by one or more of these genes. Using similar 
vector systems as employed for overexpression of the 
oxidative genes in S.natalensis. other microorganisms, 
preferably Streptomycetes species, such as the easily 
accessible strain Streptomyces lividans, can be genetically 
transformed and thus acquire new oxidative enzymatic 

activity. Thio route -in r nrti r n l ,i i l ^ i ua uf u l loj. ap p l.L c aLAuu 

of the enzymatic activities to the oxidative modification of 
other bioactive compounds, such as secondary metabolites, 
35 antibiotics, anticancer agents etc., which often are highly 
functionalized chemical entities. Thus, it is possible to 
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introduce one or more of the present genes in hosts producing 
such bioactive compoxinds naturally or have acquired the 
genetic information to produce compounds by recombinant DNA 
technology. Strains having acquired the genes encoding the 
oxidative enzymatic activity from the Pimarcin biosynthetic 
gene cluster will then be able to introduce epoxide functions 
or alcohol, aldehyde, or carboxyl groups into metabolites 
previously not modified in such a way. Thus, an approach has 
been provided, which allows for the creation of new variants 
of bioactive compounds not obtainable by chemical means 
(exemplified in Example 6 below) , 

The invention will now be elucidated by the 
following, non-restrictive exanples. 
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EXAMPLES 



B?<;^ii[pie It igQi^tion ^pd idgntAgic^tiQn of pipi^yigj^n 

biosynthetic genes. 



20 Streptoinyces natalensis strain ATCC27448 was grown in 

YEME medium (D.A. Hopwood, M.J. Bibb, K.F. Chater, T. Kieser, 

C.J. Bruton, H.M. Kieser, D.J. Lydiate, CP. Smith, J.M. 

Ward, H. Schrempf, Genetic Manipulation of Streptoinyces: A 

Laboratory Manual, The John Innes Foundation, Norwich, 
25 England, 1985) at 30 ®C for 3 days. Mycelium was harvested and 

total DNA was extracted and purified essentially as described 

by Hopwood (ibid.) . 

Total S.mttalensis DNA was subjected to partial 

digestion with restriction enzyme SauSAI and size 
30 fractionated on 0.8% agarose gel. Fragments of 30-40 kbp were 

isolated, inserted into BamHI digested cosmid Supercosl and 
su bsequently int r oduced in E.coli strain XLl -Blue MR 

according to protocols suggested by the supplier (Stratagene, 

La Jolla) . 

35 Thus a cosmid library of S.natalBzisis DNA in E.coli 

was obtained. This cosmid library was then screened for the 
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presence of polyketide synthase (PKS) related sequences by 
hybridization with radioactively labeled fragments from knovm 
PKS genes from the Rapamycin biosynthesis cluster from 
Streptojnyces hygroscopicus (T.Schwecke, J.F.Aparicio, 
5 Y.Molnlr, A.Konig, L.E.Khaw, S.F.Haydock, M.Oliynyk, 

P.Caffrey, J.Cortes, J.B.Lester, G.A.Bohm, J.Staunton, 
P.F.Leadlay. 1995. Proc. Natl. Acad. Sci. USA 92: 7839-7843). 

Several clones were isolated as containing sequences 
hybridizing to a fragment containing the KS module 5 of of 
10 rapB. 

Complete DNA sequence determination of a number of 
neighbouring IfotI fragments from Cos9 was performed after 
cloning the fragments in pBluescript. Computer assisted 
analysis of the DNA sequences revealed the presence of genes 

15 clearly identifiable as PKS gene modules by nucleotide and 
derived amino acid sequence homology with established PKS 
genes and proteins, involved in the biosynthesis of 
erythromycin and rapamycin, as well as with fatty acid 
synthase genes and proteins, which catalyze a similar set of 

20 reactions. The complete nucleotide sequences and derived 

amino acid sequences of two Pimaricin PKS genes are given as 
SEQ ID numbers 1-4. Using a 0.7 kb NotI fragment from Cos9 as 
a probe, the extent of the PKS related genes on the cosmid 
map was established as indicated in Figure 1. 

25 



Bxatnple 2. PKS genes are ess«=.nhi p.1 for Pi m^vi r^-i 
biosvntheaig 

A completely sequenced 3.3 kb Wbtl DNA fragment (see 
Figure 1) (in pBluescript) , encoding (part of) a S .nat^leziBis 

PK S ac d e duced f orm tho organizational and structural — ■■ 

sequence similarities with known PKS, was excised by Sad 
from the sequencing vector, subcloned in the phage vector 
35 KC515 (M.R.Rodicio, C.J.Bruton, K.P.Chater. 1985. Gene 34: 
283-292) and introduced in S.lividans to obtain infectious 
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particles (recombinant phage) containing the S .natetlensis PKS 
fragment. Infection of S .natalensis using this recombinant 
phage population and selection for resistance to the 
antibiotic viomycin, allowed the isolation of lysogens, 
5 originated through integration of the recombinant phage DNA 
into the S .natalensis chromosomal DNA by homologous 
recombination of the PKS regions • 

None of 2 0 lysogens tested displayed antifungal 
activity as analyzed by an agar plate bioassay using Candida 

10 utilis as the indicator organism. Detailed analysis of one of 
the lysogens by Southern hybridization studies confirmed that 
indeed integration of the recombinant phage DNA into the 
S.iiatalensis chromosomal PKS locus had occurred. 

Culturing the lysogen with the disrupted PKS gene in 

15 standard production medium (25 g/1 soya peptone, 0.5 mM 

ZnS04, 20 g/1 glucose, pH 7.5) followed by extraction of the 
culture broth with butanol , and UV spectrophotometric 
analysis indicated that no traces of Pimaricin were produced 
by this lysogen (J.F.Martin, A.L.Demain. 1975. Biochem. 

20 Biophys. Res. Coromun. 71: 1103-1109) . 
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Px;aTnple 3 t Detailed gegugng^ ^nalygj^S p£ XXQTX-FK^ 
genes: preliminary identification. 

Full sequence analysis of the regions flanking the 
PKS genes of Example 1 revealed the presence of additional 
open reading frames (ORF) potentially encoding proteins 
functional in Pimaricin biosynthesis. 

Homology comparison of the deduced amino acids 
sequences of the ORFs indicated the involvement of several in 
oxidation/ reduction reactions: OKjfx showed a clear homology 
with previously identified cholesterol oxidases, ORF2 and 
ORF3 were similar to cytochrome P-450 monooxygenase proteins. 
Also genes encoding accessory proteins for the P-450 enzymes 
seem to be present i.e. ferredoxin type. Complete nucleotide 
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sequences of the respective genes and derived amino acid 
sequences are added as SEQ ID numbers 5-10. 



4- Functional characterization o f PimaT-ir^in 
biosynthesis non-PKS qeng>g 

To define the involvement of the accessory 
genes/proteins in Pimaricin biosynthesis, the function each 
of the major ORFs, i.e* ORFl, 0RF2, and ORF3, was disrupted 
and the effect on Pimaricin production established. Similar 
strategies as described in Example 2 for the PKS disruption 
were employed for the non-PKS genes. Detailed information on 
the chromosomal regions enorapassing the three open reading 
frames (ORF's) is presented in Figure 2. 

ORFl: a 7kb SphI fragment containing the complete 
ORFl was cloned into pUC19, the resulting plasmid was 
digested with Bgrlll, the cohesive ends were filled in by 
treatment with Klenow polymerase and religated. This new 
plasmid was used as a source for DNA for the gene 
replacement. The 2 . 9 kb BamHI-Pstl fragment from the plasmid 
was cloned into the BamHI-Pstl siteds of KC515. Lysogens were 
obtained by selection for thiostrepton. The second 
recombination event was searched for by the loss of 
thiostrepton resistance. The insertion and subsequent loss of 
the phage was confirmed by Southern hybridization. 

ORF2: a 1.6 kb Sail fragment encompassing most of 
ORF2 was inserted in pUC19; the resulting plasmid was cut 
with restriction enzyme BstBI and treated with T4 DNA 
polymerase to disrupt the presumed coding sequence, followed 
by religation. Again a gene replacement strategy was followed 

required in which the intact, chromo s omal copy-of ■ORF2 was 

exchanged for the disrupted variant. Thus the BaniHI-PstI 
fragment of the plasmid was transferred to the phage vector 
KC515, via which lysogens were obtained as described above, 
having the recombinant phage DNA inserted in the 
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corresponding S .nsLtctlGnsis chromosomal locus. For disruption 
of the gene a second homologous recombination event was 
selected for by loss of thiostrepton resistance. Thiostrepton 
sensitive strains derived from the ORF2 lysogens were 
analyzed for the presence of the intact or disrupted copy by 
Southern hybridization. 

Alternatively, a 1.95 kb BairiHI fragment can be used 
after cloning in pUC19/BairiHI, and transfer, after 
modification of the Bs tBI restriction site, of the BairiHI 
fragment to phage vector KC515. 

0RF3 : direct disruption was accomplished by insertion 
of a 987 bp PvuII fragment internal to ORF3 into the phage 
vector KC515 and transformation of S.livxdeuns to obtain 
infectious recombinant phage particles. After transfection of 
S.natalensis lysogens were isolated as described above* In a 
different approach gene disruption is realized by insertion 
of a 3.4 kb Sail fragment encompassing the complete ORF3 in 
Sail digested pUC19, followed by digestion of the resulting 
plasmid with MIuI and by T4 DNA polymerase treatment^ and 
religation. Subsequently, the BainHI-PstI fragment of the 
plasmid was transferred to the phage vector KC515, via which 
lysogens were obtained as described above, having the 
recombinant phage DNA inserted in the corresponding 
S .natalensis chromosomal locus. For disruption of the gene a 
second homologous recombination event was selected for by 
loss of thiostrepton resistance. Thiostrepton sensitive 
strains derived from the ORF3 lysogens were analyzed for the 
presence of the intact or disrupted copy by Southern 
hybr idi zat ion • 



Example s. Anaxysia of okkx , utttrz . and okf:^ aene 
digmptants of S . natal ensis . 

35 Each of the separate gene disruptants was analyzed 

using the bioassay with C.u tills. None displayed an 



10 



16 

antifxmgal activity as distinct as the wild-type strain 
S . natalensls ATCC27448. 

All disruptions combined in one strain also did not 
produce in Wvo antifungal activity, in the latter strain 
upon growth in pimaricin production medium (see Example 2) an 
analysis of total metabolites from the culture broth 
indicated the presence of a Pimaricin precursor, wherein C4, 
C5 and the carbon atom of the carboxyl group at C12 are 
unoxidized carbon atoms at (see Figure 3B) . This was 
determined after extraction of the whole broth with methanol 
using NMR spectroscopy. 



15 



S.natalensifi. 



Overexpression of ORPl, ORF2 and ORP3 separately was 
obtained by placing each gene under the direction of the ermE 
promoter from Saccharopolyspora erythraea (M.J. Bibb, G.R. 
20 Janssen, J.M. Ward. 1985. Gene 38: 215-226). A useful 

derivative of this promoter, having a number of cloning sites 
attached was obtained by PGR using the following 
oligonucleotides: SEQ ID 11: 

AAACTGCAGCTCTAGAGGCGGCTTGCGCCCGATGCTAGTC 
25 SEQ ID 12: 

AAACTGCAGCTCTAGATGCCCGGGTATCQATCGTCGACGGCATGCGQATCCTACCAACCGG 
CACGATTG 



30 



The 225 bp PGR fragment obtained was digested with 
PstI, purified by agarose gel electrophoresis and inserted in 
PstI digested pUClS, yielding pUGermE 



35 



ORFl was inserted in pUGermE as a 2.2 kb Sphl-Clal 
fragment encompassing the complete coding sequence, for ORF2 
the 3.5 kb Clal-Nrul fragment was used, and for ORP3 the 2.8 
kb Sall-Kpnl fragment. Each ermE promoter-ORP combination was 
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subsequently excised as a Ps tl fragment, inserted in PstI 
digested phage vector KC515 and introduced in S.natalensis 
essentially as described in Example 4. 

Recombinant S.natalensis strains thus obtained, 
overexpressing each of the three genes separately, showed 
improved Pimaricin production levels by 10 -15 % after growth 
under standard production conditions (see Example 2) . 

Example 7. Functional expression of S.natalensis 
nPFl. - and ORF3 in S.lividans. 

The erraE promoter-ORF combinations were excised from 
the corresponding pUCermE construct as an Xbal fragment, 
inserted in Xbal digested plasmid pIJ486 (J.M.Ward, 
G.R.Janssen, T.Kieser, M.J.Bibb, M.J.Buttner, M.J.Bibb. 1986. 
Mol. Gen -Genet. 203: 468-478) and introduced by transformation 
into S.lividans. Enzymatic assays confirmed the activity of 
the different ORFs as oxidizing enzymes. Surprisingly, in the 
presence of artificial oligoketides synthesized by assembled 
PKS modules from the Erythromycin pathway, oxidizing activity 
on the exo- methyl groups of these oligoketides was observed 
similar to the oxidations observed in the Pimaricin 
polyketide. 
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SEQUENCE LISTING 
<110> DSM N.V. 

<120> Genes encoding enzymes in the biosynthetis of pimaricin and 
applications thereof 

<130> EP-2959 P 

<140> 
<141> 

<160> 12 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 5544 
<212> DNA 

<213> Streptotnyces natalensis 

<220> 

<221> CDS 

<222> (1)..(5544) 

<400> 1 

atg gtg ccc gtc cac aca gat gac tac gcg ate cag ccg ccc gcc gac 48 

Met Val Pro Val His rhr Asp Asp Tyr Ala He Gin Pro Pro Ala Asp 
1 5 10 15 

acc gcg cat gga ggg ggt ggc ttc aca ctg cct get gtg ttc gag gcc 96 
Thr Ala His Gly Gly Gly Gly Phe Thr Leu Pro Ala Val Phe Glu Ala 
20 25 30 

gcc gtg gaa tec gcc ccc gac gca gtg gee etc gtc gac ggc acc gtt 144 
Ala Val Glu Ser Ala Pro Asp Ala Val Ala Leu Val Asp Gly Thr Val 
35 40 45 

cct gga cct ggg cgc atg tgg egc gca gac gtc gat gcg ctg gcc cge 192 
Pro Gly Pro Gly Arg Met Trp Arg Ala Asp Val Asp Ala Leu Ala Arg 
50 55 60 

ggg ttg cag gag teg ggt ate gca ccc ggt gac gtg gtg gcg qtq caq 240 
Gly Leu Gin Glu Ser Gly He Ala Pro Gly Lp Val Val All Val 
^5 70 75 80 

ctg ccg aac tgc ggg agg ttt ccc acc ctg cac ctg gee gtc gcg gcc 288 
Leu Pro Asn Cys Gly Arg Phe Pro Thr Leu His Leu Ala Val Ala Ala 
85 90 95 

gtc gga gcg gta ctg ctg ccc ate cac cag ggc acc ccg etc ccg gag 336 
Val Gly Ala Val Leu Leu Pro He His Gin Gly Thr Pro Leu Pro Glu 
100 105 110 

gtc gac gcg ctg ctg acc egg gcg gaa ccc gcc ctt etc gtc ctg teg 384 
Val Asp Ala Leu Leu Thr Arg Ala Glu Pro A la Leu Leu Val Leu Ser 

115 ~- 120 = 125 — ^- ^ 



f?^ S?^ Srcg acg gcc cgt teg ctt ctg gag age 432 

Ala Ala Gly Ser Asp Gly Leu Ala Thr Ala Arg Ser Leu Leu Glu Ser 
130 135 140 

gtg ccg teg ctg cge ggc gtg ctg ctg gcc ggg gcg teg gga gac ggg 480 
Val Pro Ser Leu Arg Gly Val Leu Leu Ala Gly Ala Ser Gly Asp Gly 

150 155 160 
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gag tec gga tec gtc gga ggt ggg gag tea gga tec ggg egt egg teg 
Glu Ser Gly Ser Val Gly Gly Gly Glu Ser Gly Ser Gly Arg Arg Ser 
165 170 175 



528 



ctg gac ggg ctg etg gee gge tgg geg ggc age ggg ccc egg ccg gtg 
Leu Asp Gly Leu Leu Ala Gly Trp Ala Gly Ser Gly Pro Arg Pro Val 
180 185 190 



576 



gac gtc acc ccc gat atg ccg etc gtt etc gtc ccc teg tec ggg aeg 624 
Asp Val Thr Pro Asp Met Pro Leu Val Leu Val Pro Ser Ser Gly Thr 
195 200 205 

gtc teg gca egg ccc aaa ctg tgt gtg cac age cac gac ggt ctg etg 672 
Val Ser Ala Arg Pro Lys Leu Cys Val His Ser His Asp Gly Leu Leu 
210 215 220 

teg aae ace gea gee gtc acc gee gag gcc gcc gac gcc ttc gac ggt 720 
Ser Asn Thr Ala Ala Val Thr Ala Glu Ala Ala Asp Ala Phe Asp Gly 
225 230 235 240 

ccg gtc etc aeg gee tge ccg atg ace eat etg ttc gge etc cag tec 768 
Pro Val Leu Thr Ala Cys Pro Met Thr His Leu Phe Gly Leu Gin Ser 
245 250 255 

ctg cac geg geg etg ttc gee gee tge aeg cag gtc etg etc acc gge 816 
Leu His Ala Ala Leu Phe Ala Ala Cys Thr Gin Val Leu Leu Thr Gly 
260 265 270 

tgg gac gtg gac cgc ttc ctg gaa cag get egg gag cac gge ccc egt 864 
Trp Asp Val Asp Arg Phe Leu Glu Gin Ala Arg Glu His Gly Pro Arg 
275 280 285 

gtc gtg ttc gcc gtg ccc gcc caa etg egg gac gtc gtg aca cga ctg 912 
Val Val Phe Ala Val Pro Ala Gin Leu Arg Asp Val Val Thr Arg Leu 
290 295 300 

gcc agg acc ggc gaa ccg geg ggc ttc aeg ccg tac cag gtg egt ace 960 
Ala Arg Thr Gly Glu Pro Ala Gly Phe Thr Pro Tyr Gin Val Arg Thr 
305 310 315- 320 

geg ggg gee gcc gtc geg ccc geg etc gcc gta egg gta cgc gcc gtc 1008 
Ala Gly Ala Ala Val Ala Pro Ala Leu Ala Val Arg Val Arg Ala Val 
325 330 335 



etc gac tge gaa ctg gtc gtg gtg tgg ggg atg tee gag ate ggc acc 
Leu Asp Cys Glu Leu Val Val Val Trp Gly Met Ser Glu lie Gly Thr 
340 345 350 



1056 



ggc ace cgc ace egg gee cac cac ccc gac ggc tgt gtg gga gag ccg 1104 
Gly Thr Arg Thr Arg Ala His His Pro Asp Gly Cys Val Gly Glu Pro 
355 360 365 

gtc age ggg gtg gac gtg cgc gtc gtc gac gag cac ggc caa gag tge 1152 
Val Ser Gly Val Asp Val Arg Val Val Asp Glu His Gly Gin Glu Cys 
370 375 380 

geg geg gac gag agg ggt gaa etc cag tac egg ggg eca ggg ttg ttc 1200 
Ala Ala Asp Glu Arg Gly Glu Leu Gin Tyr Arg Gly Pro Gly Leu Phe 
385 390 395 400 

cgc ggc tat ttc cgc gaa ccg gag ctg aeg cgc teg gee etc ace gac 1248 
Arg Gly Tyr Phe Arg Glu Pro Glu Leu Thr Arg Ser Ala Leu Thr Asp 
405 410 415 
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gac ggc tgg ctg egg acc ggc gat etc gcg ace gte gac gcg gac ggc 1296 
Asp Gly Trp Leu Arg Thr Gly Asp Leu Ala Thr Val Asp Ala Asp Gly 
420 425 430 

gtg gtg gtg ctg cac ggc egg gcg gcc gag ctg ate aat acg ggg ggc 1344 
Val Val Val Leu His Gly Arg Ala Ala Glu Leu lie Asn Thr Gly Glv 
435 440 445 

egg aag ttc age gcc ggc gag gte gag gga ctg etc tec ggc ttt acg 1392 
Arg Lys Phe Ser Ala Gly Glu Val Glu Gly Leu Leu Ser Gly Phe Thr 
450 455 460 

gac ctg gge ceg ctg gee gte gte ggc gca ecg gac gac egg etc ggg 144 0 
Asp Leu Gly Pro Leu Ala Val Val Gly Ala Pro Asp Asp Arg Leu Gly 
465 470 475 480 

gag tac ecg tge ctg gte gtg ace gac cac gcg gac ggc ace ate ggc 1488 
Glu Tyr Pro Cys Leu Val Val Thr Asp His Ala Asp Gly Thr He Gly 
485 490 495 

ctg age gag gtg acc gcg ttc ctg cge egg ctg ggg etc gee gac cac 1536 
Leu Ser Glu Val Thr Ala Phe Leu Arg Arg Leu Gly Leu Ala Asp His 
500 505 510 

aag ate cce etc gaa ctg gte acc gtg cge gag ett cec ttc tec cct 1584 
Lys He Pro Leu Glu Leu Val Thr Val Arg Glu Leu Pro Phe Ser Pro 
515 520 525 



gcc ggg aag etc gac cge ggg gcg etc aag egg ctg etc gcc aac ett 
Ala Gly Lys Leu Asp Arg Gly Ala Leu Lys Arg Leu Leu Ala Asn Leu 
530 535 540 



1632 



gcg gag gtg tec gtt ecg gcc cgt etc ggc gcg gte ecg cce tac acg 1680 

Ala Glu Val Ser Val Pro Ala Arg Leu Gly Ala Val Pro Pro Tyr Thr 
545 550 555 560 

gcc gag gag gcg ctg gac etc gta cge gac tge gte ggc egg gtg etc 1728 

Ala Glu Glu Ala Leu Asp Leu Val Arg Asp Cys Val Gly Arg Val Leu 
565 570 575 

cgt tac ggc gge gcg gcc gte cce ttc cce ecg gac aag gac ttc ttc 1776 

Arg Tyr Gly Gly Ala Ala Val Pro Phe Pro Pro Asp Lys Asp Phe Phe 
580 585 590 

tec ceg gac aag gac ttc cge cag etc ggc ctg gac tec ate ggc gcg 1824 

Ser Pro Asp Lys Asp Phe Arg Gin Leu Gly Leu Asp Ser He Gly Ala 

595 600 605 



gtg egg ctg egg aat ctg ctg egg gag gag acc ggg ctg ceg etc ecg 
Val Arg Leu Arg Asn Leu Leu Arg Glu Glu Thr Gly Leu Pro Leu Pro 
610 615 620 



1872 



gcc acc ctg gcc ttc gac tec cec act ecg cga gcc gte gcg cge gte 
Ala Thr Leu Ala Phe Asp Ser Pro Thr Pro Arg Ala Val Ala Arg Val 
625 630 635 640 



1920 



ct c T cfcc g a g c a g g a n n a g e r a tcr r a g gar g;^g rr.g agg gag a a n r^nr j 1 Qff ff 
Leu Ala Glu Gin Glu Glu Pro Ser Gin Asp Glu Pro Arg Glu Asn Pro 
645 650 655 



gcg gac ggt gee gac ceg gtg gcg ate gtg gge atg gcc tge egg ctg 
Ala Asp Gly Ala Asp Pro Val Ala He Val Gly Met Ala Cys Arg Leu 
660 665 670 



2016 



ceg ggc gga gcc gac tee cce gac gcc ctg tgg gag ctg etc gcc gac 
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Pro Gly Gly Ala Asp Ser Pro Asp Ala Leu Trp Glu Leu Leu Ala Asp 
675 680 685 



ggg acc gac gcg atg tec ccc ttc ccc acg gac cgc ggc tgg gac ctg 
Gly Thr Asp Ala Met Ser Pro Phe Pro Thr Asp Arg Gly Trp Asp Leu 
690 695 700 



2112 



gac egg ctg ttc gac gag gat gcc gac cgc ccg ggt acc teg tac gee 
Asp Arg Leu Phe Asp Glu Asp Ala Asp Arg Pro Gly Thr Ser Tyr Ala 
705 710 715 720 



2160 



cgc gaa ggc ggc ttc ctg cac gac gcg ggc gac ttc gac gcg ggc ttc 
Arg Glu Gly Gly Phe Leu His Asp Ala Gly Asp Phe Asp Ala Gly Phe 
725 730 735 



2208 



ttc ggc ctg teg gac cag gag gcg acg gcg acc gat ccg cag cag egg 
Phe Gly Leu Ser Asp Gin Glu Ala Thr Ala Thr Asp Pro Gin Gin Arg 
740 745 750 



2256 



ctg ctt ctg gag gcg gcc tgg gag acc ttc gag egg gcg ggc ate gac 
Leu Leu Leu Glu Ala Ala Trp Glu Thr Phe Glu Arg Ala Gly lie Asp 
755 760 765 



2304 



ccg cag tec ctg agg gga age cgt acg ggc gtg ttc acg ggc gcg atg 
Pro Gin Ser Leu Arg Gly Ser TVrg Thr Gly Val Phe Thr Gly Ala Met 
770 775 780 



2352 



gac cgc ggc tat gga acc age gcg tee gcc gcg ccc age gea tgg gag 2400 
Asp Arg Gly Tyr Gly Thr Ser Ala Ser Ala Ala Pro Ser Ala Trp Glu 
785 790 795 800 

age atg etc ate acc ggg ace gee ggc age gcg gte teg ggg cgc ate 2448 
Ser Met Leu lie Thr Gly Thr Ala Gly Ser Ala Val Ser Gly Arg He 
805 810 815 



gee tac acc tac ggg etc gaa ggc ccc gcg ctg acg gte gac ace gcc 
Ala Tyr Thr Tyr Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala 
820 825 830 



2496 



tec teg tec tec etc gte gcc ctg eat ctg gcc tgc egg teg ctg cgc 
Ser Ser Ser Ser Leu Val Ala Leu His Leu Ala Cys Arg Ser Leu Arg 
835 840 845 



2544 



teg ggc gag acc gac ctg gcg ctg gee ggc ggc gte acc gte atg gcg 
Ser Gly Glu Thr Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala 
850 855 860 



2592 



acc ccg gcg ccc ttc gea cac ttc tec egg ctg cgc gcg ctg tee ccc 
Thr Pro Ala Pro Phe Ala His Phe Ser Arg Leu Arg Ala Leu Ser Pro 
865 870 875 880 



2640 



gac tec cgc tec atg gcg tac gcg gac gcc gcg aac ggc teg gcg tgg 
Asp Ser Arg Ser Met Ala Tyr Ala Asp Ala Ala Asn Gly Ser Ala Trp 
885 890 895 



2688 



teg ga g ggc gcg ggg ctg ctt ctg ctg gag egg ctg age gac gcc egg 2736 
Ser Glu Gly Ala Gly Leu Leu Leu Leu Glu Arg Leu Ser Asp Ala Arg 
900 905 910 



cgc aac gga cac cgt gte ctg gcg etc gta egg ggc tee gee gtg aat 
Arg Asn Gly His Arg Val Leu Ala Leu Val Arg Gly Ser Ala Val Asn 
915 920 925 



2784 



cag gac ggc gcc tee aac ggg etc acc gcg ccg age gga ccc gea cag 
Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin 



2832 
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930 935 940 

l^"" fr^^ ^^"^ S?^ ff"" ""^^ Sr^^ Srcc ggg ctg acg ccg cag 2880 

Gin Arg Val He Arg Gin Ala Leu Ala Asp Ala Gly Leu Thr Pro Gin 

950 955 960 

gac gtg gac gcc gtg gag ggg cac ggc acc ggc acg ccg etc ggc gac 2928 
Asp Val Asp Ala Val Glu Gly His Gly Thr Gly Thr Pro Leu Gly Asp 
965 970 975 

ccc ate gag gcg cag gcg ctg etg gee acg tac ggc caa cag egg cct 2976 
Pro He Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin Gin Arg Pro 
980 985 990 

gtg gaa egg ccg ttg tgg ctg ggg teg gtg aag teg aac ttc ggg cac 3024 
Val Glu Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Phe Gly His 
995 1000 1005 

aca caa gcc gcc gcc ggg gtc gtc ggc gtc ate aag acg gtg etc gcg 3072 
Thr Gin Ala Ala Ala Gly Val Val Gly Val He Lys Thr Val Leu Ala 
1010 1015 1020 

ctg cgc cac ggc gtc ctg ccg cag acg ctg cac gtg gac get ccc teg 3120 
Leu Arg Hxs Gly Val Leu Pro Gin Thr Leu His Val Asp Ala Pro Ser 
1025 1030 1035 1040 

gee aag gtg gac tgg tec gcc ggt teg gta egg etg etg acc gag gcg 3168 
Ala Lys Val Asp Trp Ser Ala Gly Ser Val Arg Leu Leu Thr Glu Ala 
1045 1050 1055 

egg ccc tgg cca egg gag age gga cgt acg cgc egg gcg ggg gtg tec 3216 
Arg Pro Trp Pro Arg Glu Ser Gly Arg Thr Arg Arg Ala Gly Val Ser 
1060 1065 1070 



3264 



teg ttc ggg etc ace ggc ace aac gcg cac gtg ate ctg gag gag gcg 
Ser Phe Gly Leu Thr Gly Thr Asn Ala His Val He Leu Glu Glu Ala 
1075 1080 1085 

ccg gga gag gcg gcg gca ggg gcg egg gee gag gtt ccc gag gag gcg 3312 
Pro Gly Glu Ala Ala Ala Gly Ala Arg Ala Glu Val Pro Glu Glu Ala 
1090 1095 1100 

egg tgc gee tec tea ccg get ega etc ccg gag ccg ccc ggc gac gcg 3360 ^ 
TVrg Cys Ala Ser Ser Pro Ala Arg Leu Pro Glu Pro Pro Gly Asp All m 
1105 1110 1115 1120 ^ 

gcc gcg ccc tgg gtg ctg tee gee egg age egg gcg gcg etg cgc gcg 3408 
Ala Ala Pro Trp Val Leu Ser Ala Arg Ser Arg Ala Ala Leu Arg Ala 
1125 1130 1135 

cag gcg etc cgc ctg gee gac cag gtg gee gee gac ccc ggt eta egg 3456 
Gin Ala Leu Arg Leu Ala Asp Gin Val Ala Ala Asp Pro Gly Leu Arg 
1140 1145 1150 

gee cag gat gtc gee cat gcc ctg gee ace tec cgt acc etg cac egg 3504 
Ala Gin Asp Val Ala His Ala Leu Ala Thr Ser Arg Thr Leu His Arg 

TLBS- rreo — 116b — — — = — 



cac egg gcc gtc gtc age ggc tec gac egg gca caa atg etc gee gcg 3 552 
His Arg Ala Val Val Ser Gly Ser Asp Arg Ala Gin Met Leu Ala Ala 
1170 1175 1180 

gca aag egg ttc ggg etc ggt gag egg acc gcg ggc gtc acc ccg gac 3 600 
Ala Lys Arg Phe Gly Leu Gly Glu Arg Thr Ala Gly Val Thr Pro Asp 
1185 1190 1195 1200 
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gat tec gcg ccg ggc ctg ctg gcc ttc gtc ttc tec ggg cag ggc age 
Asp Ser Ala Pro Gly Leu Leu Ala Phe Val Phe Ser Gly Gin Gly Ser 
1205 1210 1215 



3648 



cag cgc age ggc atg ggg cgc gcg gcg gcc gag gcg ttc ccg gtc ttc 
Gin Arg Ser Gly Met Gly Arg Ala Ala Ala Glu Ala Phe Pro Val Phe 
1220 1225 1230 



3696 



gga egg gcg ctg ggc gag gtg tgc gcc gcg ctg gac ccg ctg ctg aca 3744 
Gly Arg Ala Leu Gly Glu Val Cys Ala Ala Leu Asp Pro Leu Leu Thr 
1235 1240 1245 

cgc cca ctg acc teg gtg atg tgg gcg get cec ggc tec gag gag gcg 3792 
Arg Pro Leu Thr Ser Val Met Trp Ala Ala Pro Gly Ser Glu Glu Ala 
1250 1255 1260 

gcc cgt etc gac gac ace ace tae aeg eag ecc gee ctg ttc gee gtc 3840 
Ala Arg Leu Asp Asp Thr Thr Tyr Thr Gin Pro Ala Leu Phe Ala Val 
1265 1270 1275 1280 



eag gtc gee ctg tae egg ctg ttc gag tee tgg ggc gtg gtg ccg gae 
Gin Val Ala Leu Tyr Arg Leu Phe Glu Ser Trp Gly Val Val Pro Asp 
1285 1290 1295 



3888 



eag ctg gtg ggg cat teg gtc ggc gag ate tee gee gee eat gtg gea 
Gin Leu Val Gly His Ser Val Gly Glu lie Ser Ala Ala His Val Ala 
1300 1305 1310 



3936 



ggc gtg etc gge etc egg gae gcg tgc ace ctg gtg gcg gee cgt age 
Gly Val Leu Gly Leu Arg Asp Ala Cys Thr Leu Val Ala Ala Arg Ser 
1315 1320 1325 



3984 



agg ctg atg ggc gcg ctg ccg ccc ggc ggt gcg atg gtg gcg gta cgc 4032 
Arg Leu Met Gly Ala Leu Pro Pro Gly Gly Ala Met Val Ala Val Arg 
1330 1335 1340 

ate aeg gaa ccc gaa gtg acc cca tgg etc gcg gag ttg aeg gae gag 4080 
lie Thr Glu Pro Glu Val Thr Pro Trp Leu Ala Glu Leu Thr Asp Glu 
1345 1350 1355 1360 

gtg teg ate gcg gcc gtc aac ggt ccg cac tec etc gtg etc gcg gge 4128 
Val Ser He Ala Ala Val Asn Gly Pro His Ser Leu Val Leu Ala Gly 
1365 1370 1375 



gcc gag gee ccg etc gtc gee etc aeg gae egg etc gee gee gee gga 
Ala Glu Ala Pro Leu Val Ala Leu Thr Asp Arg Leu Ala Ala Ala Gly 
1380 1385 1390 



4176 



cac aag ace egg cgc etc atg gtg age ace gcg cec eac teg ccg ctg 
His Lys Thr Arg Arg Leu Met Val Ser Thr Ala Pro His Ser Pro Leu 
1395 1400 1405 



4224 



atg gae ecc atg ctg gag gag ttc cgc gcg gtc gtc cgc aeg ctg tee 4272 
Met Asp Pro Met Leu Glu Glu Phe Arg Ala Val Val Arg Thr Leu Ser 

1410 1415 1420 

tac gcc gcg ccc gcc gtt ccc etc gtc tec acc gtc acc ggc cgc ccg 4320 
Tyr Ala Ala Pro Ala Val Pro Leu Val Ser Thr Val Thr Gly Arg Pro 
1425 1430 1435 1440 

ctg acc ggc gag gag gcg cgc gac ccg gac cac tgg gtg egg cat gtg 4368 
Leu Thr Gly Glu Glu Ala Arg Asp Pro Asp His Trp Val Arg His Val 
1445 1450 1455 
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egg cag tec gtc cgc ttc aag gac gcg ate ggc egg etc egg gae gaa 4416 
Arg Gin Ser Val Arg Phe Lys Asp Ala He Gly Arg Leu Arg Asp Glu 
1460 1465 1470 

cgc gte ace ggg ttc etg gag etg ggt gee gaa ecg gca etc aca ccg 4464 
Arg Val Thr Gly Phe Leu Glu Leu Gly Ala Glu Pro Ala Leu Thr Pro 
1475 1480 1485 

atg ate gac gag tgc etg gag tee gee gae ggg eag cec ggg ace gee 4512 
Met He Asp Glu Cys Leu Glu Ser Ala Asp Gly Gin Pro Gly Thr Ala 
1490 1495 1500 

etg gtg ccg agt etg cgc gee gga gtg ccg gag egg gat gcc etg etc 4560 
Leu Val Pro Ser Leu Arg Ala Gly Val Pro Glu Arg Asp Ala Leu Leu 
1505 1510 1515 1520 

ace gcg gtc gcc egg gtg cac gcc cag ggc gtt ccc gtc gac tgg gae 4608 
Thr Ala Val Ala Arg Val His Ala Gin Gly Val Pro Val Asp Trp Asp 
1525 1530 1535 

gcg gtg etc ccc ggg gcc gag get tec gtc acc gtg cgc gga etg ccc 4656 
Ala Val Leu Pro Gly Ala Glu Ala Ser Val Thr Val Arg Gly Leu Pro 
1540 1545 1550 

gcc gee gac cgc cag tgg ttc cgc ttc gtc ccc gae cag ggc gcg ecg 4704 
Ala Ala Asp Arg Gin Trp Phe Arg Phe Val Pro Asp Gin Gly Ala Pro 
1555 1560 1565 

etc acc etc gcc gac cgc teg etg cac etg gag ggc gcc gcc cac etc 4752 
Leu Thr Leu Ala Asp Arg Ser Leu His Leu Glu Gly Ala Ala His Leu 
1570 1575 1580 

cgc gae gtg ggc ggc tgt cgc ace gcc gac ggc egg tgg gtg aaa atg 4800 
Arg Asp Val Gly Gly Cys Arg Thr Ala Asp Gly Arg Trp Val Lys Met 
1585 1590 1595 1600 

ggc gtc etg tac cgc ace aac aat etg cac gcg etc acc gac gcc gac 4848 
Gly Val Leu Tyr Arg Thr Asn Asn Leu His Ala Leu Thr Asp Ala Asp 
1605 1610 1615 

etg gcc aag etc cag cgc etc ggc ate egt aeg gac ttc gac etg cgc 4896 
Leu Ala Lys Leu Gin Arg Leu Gly He Arg Thr Asp Phe Asp Leu Arg 
1620 1625 1630 

atg ecg ggc gag cgc gee aaa gcc ccg aac cgc gtg ccc acg ggc gcg 4944 
Met Pro Gly Glu Arg Ala Lys Ala Pro Asn Arg Val Pro Thr Gly Ala 
1635 1640 1645 

cgc tat ate gte gcc gat gcc ttt gac gca cat etg egg gac ggg etg 4992 
Arg Tyr He Val Ala Asp Ala Phe Asp Ala His Leu Arg Asp Gly Leu 
1650 1655 1660 



agt etg age gag egg gac etg gag egg etg cgc gag gcc egg etg tec 
Ser Leu Ser Glu Arg Asp Leu Glu Arg Leu Arg Glu Ala Arg Leu Ser 
1665 1670 1675 1680 



5040 



ggt etc gac gag gcc gaa cag cac gcg etc gt a etg gee etg gtc etc 5088 
Gly Leu Asp Glu Ala Glu Gin His Ala Leu Val Leu Ala Leu Val Leu 
1685 1690 1695 

gcc gag acc teg gcc gtg etc ggc ggc cag gag acg cet ggc gag gag 5136 
Ala Glu Thr Ser Ala Val Leu Gly Gly Gin Glu Thr Pro Gly Glu Glu 
1700 1705 1710 



ccg cac ggc gag gag ggc cac cgc acg ttc aag gag atg ggc ate aac 



5184 



Pro His Gly Glu Glu Gly His Arg Thr Phe hys Glu Met Gly lie Asn 
1715 1720 1725 

teg ttg aac gcc gtc gaa ctg cgc aac cgc ctt ate gca gee acg gae 5232 
Ser Leu Asn Ala Val Glu Leu Arg Asn Arg Leu lie Ala Ala Thr Asp 
1730 1735 1740 

ctg egg ctt ccc gcc acg etc gtc tac gac tac ccc acg ccg aag gcc 5280 
Leu Arg Leu Pro Ala Thr Leu Val Tyr Asp Tyr Pro Thr Pro Lys Ala 
1745 1750 1755 1760 

gtc gtc cga etc gta cgc gaa cga etc gcg cga ccg gcc tec ccc gca 532 8 
Val Val Arg Leu Val Arg Glu Arg Leu Ala Arg Pro Ala Ser Pro Ala 
1765 1770 1775 



cgc gat gtg gee tec gtc gtg gee gag ttg gag tec ctg ctg acg gcc 
Arg Asp Val Ala Ser Val Val Ala Glu Leu Glu Ser Leu Leu Thr Ala 
1780 1785 1790 

ggt gcg gag gtc teg gag gag ace gcg gcg egg ttg aaa gcg gtg acg 
Gly Ala Glu Val Ser Glu Glu Thr Ala Ala Arg Leu Lys Ala Val Thr 
1795 1800 1805 

gcg gtg teg acg ggg acg acg ggg teg ggc age ggt acg ggc gcc ggc 
Ala Val Ser Thr Gly Thr Thr Gly Ser Gly Ser Gly Thr Gly Ala Gly 
1810 1815 1820 

tec ggc ggg get ctg gae ctg gta teg gee agt gae gag gaa ctg ttc 
Ser Gly Gly Ala Leu Asp Leu Val Ser Ala Ser Asp Glu Glu Leu Phe 
1825 1830 1835 1840 

egg ctg atg gac gcg gag age tga 
Arg Leu Met Asp Ala Glu Ser 
1845 



5376 



5424 



5472 



5520 



5544 



<210> 2 
<211> 1848 
<212> PRT 

<213> Streptomyces natalensis 



<400> 2 

Met Val Pro Val His Thr Asp Asp Tyr Ala lie Gin Pro Pro Ala Asp 
15 10 15 

Thr Ala His Gly Gly Gly Gly Phe Thr Leu Pro Ala Val Phe Glu Ala 
20 25 30 

Ala Val Glu Ser Ala' Pro Asp Ala Val Ala Leu Val Asp Gly Thr Val 
35 40 45 

Pro Gly Pro Gly Arg Met Trp Arg Ala Asp Val Asp Ala Leu Ala Arg 
50 55 60 

Gly Leu Gin Glu Ser Gly lie Ala Pro Gly Asp Val Val Ala Val Arg 

70 75 80 

Leu Pro Asn Cys Gly Arg Phe Pro Thr Leu His Leu Ala Val Ala Ala 
85 90 95 

Val Gly Ala Val Leu Leu Pro lie His Gin Gly Thr Pro Leu Pro Glu 
100 105 110 

Val Asp Ala Leu Leu Thr Arg Ala Glu Pro Ala Leu Leu Val Leu Ser 
115 120 125 
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Ala Ala Gly Ser Asp Gly Leu Ala Thr Ala Arg Ser Leu Leu Glu Ser 
130 135 140 

Val Pro Ser Leu Arg Gly Val Leu Leu Ala Gly Ala Ser Gly Asp Gly 
145 ISO 155 160 

Glu Ser Gly Ser Val Gly Gly Gly Glu Ser Gly Ser Gly Arg Arg Ser 
165 170 175 

Leu Asp Gly Leu Leu Ala Gly Trp Ala Gly Ser Gly Pro Arg Pro Val 
180 185 190 

Asp Val Thr Pro Asp Met Pro Leu Val Leu Val Pro Ser Ser Gly Thr 
195 200 205 

Val Ser Ala Arg Pro Lys Leu Cys Val His Ser His Asp Gly Leu Leu 
210 215 220 

Ser Asn Thr Ala Ala Val Thr Ala Glu Ala Ala Asp Ala Phe Asp Gly 
225 230 235 240 

Pro Val Leu Thr Ala Cys Pro Met Thr His Leu Phe Gly Leu Gin Ser 
245 250 255 

Leu His Ala Ala Leu Phe Ala Ala Cys Thr Gin Val Leu Leu Thr Gly 
260 265 270 

Trp Asp Val Asp Arg Phe Leu Glu Gin Ala Arg Glu His Gly Pro Arg 
275 280 285 

Val Val Phe Ala Val Pro Ala Gin Leu Arg Asp Val Val Thr Arg Leu 
290 295 300 

Ala Arg Thr Gly Glu Pro Ala Gly Phe Thr Pro Tyr Gin Val Arg Thr 
305 310 315 320 

Ala Gly Ala Ala Val Ala Pro Ala Leu Ala Val Arg Val Arg Ala Val 
325 330 335 

Leu Asp Cys Glu Leu Val Val Val Trp Gly Met Ser Glu He Gly Thr 
340 345 350 

Gly Thr Arg Thr Arg Ala His His Pro Asp Gly Cys Val Gly Glu Pro 
355 360 365 

Val Ser Gly Val Asp Val Arg Val Val Asp Glu His Gly Gin Glu Cys 
370 375 380 

Ala Ala Asp Glu Arg Gly Glu Leu Gin Tyr Arg Gly Pro Gly Leu Phe 
385 390 395 400 

Arg Gly Tyr Phe Arg Glu Pro Glu Leu Thr Arg Ser Ala Leu Thr Asp 
405 410 415 

Asp Gly Trp Leu Arg Thr Gly Asp Leu Ala Thr Val Asp Ala Asp Gly 

A>n ..^ ^ ^OB , 430- .. -z 



Val Val Val Leu His Gly Arg Ala Ala Glu Leu He Asn Thr Gly Gly 
435 440 445 

Arg Lys Phe Ser Ala Gly Glu Val Glu Gly Leu Leu Ser Gly Phe Thr 
450 455 460 

Asp Leu Gly Pro Leu Ala Val Val Gly Ala Pro Asp Asp Arg Leu Gly 



■ 



465 



470 



475 



480 



Glu Tyr Pro Cys Leu Val Val Thr Asp His Ala Asp Gly Thr lie Gly 
485 490 495 

Leu Ser Glu Val Thr Ala Phe Leu Arg Arg Leu Gly Leu Ala Asp His 
500 505 510 

Lys lie Pro Leu Glu Leu Val Thr Val Arg Glu Leu Pro Phe Ser Pro 
515 520 525 

Ala Gly Lys Leu Asp Arg Gly Ala Leu Lys Arg Leu Leu Ala Asn Leu 
530 535 540 

Ala Glu Val Ser Val Pro Ala Arg Leu Gly Ala Val Pro Pro Tyr Thr 
545 550 555 560 

Ala Glu Glu Ala Leu Asp Leu Val Arg Asp Cys Val Gly Arg Val Leu 
565 570 575 

Arg Tyr Gly Gly Ala Ala Val Pro Phe Pro Pro Asp Lys Asp Phe Phe 
580 585 590 

Ser Pro Asp Lys Asp Phe Arg Gin Leu Gly Leu Asp Ser lie Gly Ala 
595 600 605 

Val Arg Leu Arg Asn Leu Leu Arg Glu Glu Thr Gly Leu Pro Leu Pro 
610 615 620 

Ala Thr Leu Ala Phe Asp Ser Pro Thr Pro Arg Ala Val Ala Arg Val 
625 630 635 640 

Leu Ala Glu Gin Glu Glu Pro Ser Gin Asp Glu Pro Arg Glu Asn Pro 
645 650 655 

Ala Asp Gly Ala Asp Pro Val Ala lie Val Gly Met Ala Cys Arg Leu 
660 665 670 

Pro Gly Gly Ala Asp Ser Pro Asp Ala Leu Trp Glu Leu Leu Ala Asp 
675 680 685 

Gly Thr Asp Ala Met Ser Pro Phe Pro Thr Asp Arg Gly Trp Asp Leu 
690 695 700 

Asp Arg Leu Phe Asp Glu Asp Ala Asp Arg Pro Gly Thr Ser Tyr Ala 
705 710 715 720 

Arg Glu Gly Gly Phe Leu His Asp Ala Gly Asp Phe Asp Ala Gly Phe 
725 730 735 

Phe Gly Leu Ser Asp Gin Glu Ala Thr Ala Thr Asp Pro Gin Gin Arg 
740 745 750 

Leu Leu Leu Glu Ala Ala Trp Glu Thr Phe Glu Arg Ala Gly lie Asp 
755 760 765 

Pro Gin Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Thr Gly Ala Met 
770 775 780 

Asp Arg Gly Tyr Gly Thr Ser Ala Ser Ala Ala Pro Ser Ala Trp Glu 
785 790 795 800 

Ser Met Leu lie Thr Gly Thr Ala Gly Ser Ala Val Ser Gly Arg lie 
805 810 815 
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Ala Tyr Thr Tyr Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala 

825 

ser Ser Ser Ser Leu Val Ala Leu His Leu Ala Cys Arg Ser Leu Arg 
835 840 845 

Ser Gly Glu Thr Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala 
oSO 855 860 

Thr Pro Ala Pro Phe Ala His Phe Ser Arg Leu Arg Ala Leu Ser Pro 

870 875 830 

Asp ser Arg Ser Met Ala Tyr Ala Asp Ala Ala Asn Gly Ser Ala Trp 
885 890 895 

Ser Glu Gly Ala Gly Leu Leu Leu ^eu Glu Arg Leu Ser Asp Ala Arg 

Arg Asn Gly His Arg Val Leu Ala Leu Val Arg Gly Ser Ala Val Asn 
915 920 925 

Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin 
Gin Arg Val He Arg Gin Ala Leu Ala Asp Ala Gly Leu Thr Pro Gin 

^rtO 950 — . — — 



955 960 



Asp Val Asp Ala Val Glu Gly His Gly Thr Gly Thr Pro Leu Gly Asp 

970 

Pro He Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin Gin Arg Pro 
980 985 990 

Val Glu Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Phe Gly His 
995 1000 1005 

^1010 °^^1015 ^ Val Leu Ala 



1020 



Leu Arg His Gly Val Leu Pro Gin Thr Leu His Val Asp Ala Pro Ser 

"30 1035 1040 

Ala Lys val Asp Tip Ser Ala Gly Ser Val Arg Leu Leu Thr Glu Ala 
1045 1050 1055 

Arg Pro Trp Pro Arg Glu Ser Gly Arg Thr Arg Arg Ala Gly Val Ser 
1060 1065 1070 

Ser Phe Gly Leu Thr Gly Thr Asn Ala His Val He Leu Glu Glu Ala 



^"""iniK ^^"^ ^9 Ala Glu val Pro Glu Glu Ala 

1090 1095 1100 

Arg Cys Ala Ser Ser Pro Ala Arg Leu Pro Glu Pro Pro Gly Asp Ala 
mo 1115 1120 



Ala Ala Pro Trp Val Leu Ser Ala Arg Ser Arg Ala Ala Leu Arg Ala 
1125 1130 ij^3| 

Gin Ala I-e^^Arg Leu Ala Asp Gln^Val Ala Ala Asp Pro Gly Leu Arg 

Ala Gin Asp Val Ala His Ala Leu Ala Thr Ser Arg Thr Leu His Arg 
^■"•^5 1160 1165 ' 



His Arg Ala Val Val Ser Gly Ser Asp Arg Ala Gin Met Leu Ala Ala 
1170 1175 1180 

Ala Lys Arg Phe Gly Leu Gly Glu Arg Thr Ala Gly Val Thr Pro Asp 
185 1130 1195 1200 

Asp Ser Ala Pro Gly Leu Leu Ala Phe Val Phe Ser Gly Gin Gly Ser 
1205 1210 1215 

Gin Arg Ser Gly Met Gly Arg Ala Ala Ala Glu Ala Phe Pro Val Phe 
1220 1225 1230 

Gly Arg Ala Leu Gly Glu Val Cys Ala Ala Leu Asp Pro Leu Leu Thr 
1235 1240 1245 

Arg Pro Leu Thr Ser Val Met Trp Ala Ala Pro Gly Ser Glu Glu Ala 
1250 1255 1260 

Ala Arg Leu Asp Asp Thr Thr Tyr Thr Gin Pro Ala Leu Phe Ala Val 
265 1270 1275 1280 

Gin Val Ala Leu Tyr Arg Leu Phe Glu Ser Trp Gly Val Val Pro Asp 
1285 1290 1295 

Gin Leu Val Gly His Ser Val Gly Glu lie Ser Ala Ala His Val Ala 
1300 1305 1310 

Gly Val Leu Gly Leu Arg Asp Ala Cys Thr Leu Val Ala Ala Arg Ser 
1315 1320 1325 

Arg Leu Met Gly Ala Leu Pro Pro Gly Gly Ala Met Val Ala Val Arg 
1330 1335 1340 

lie Thr Glu Pro Glu Val Thr Pro Trp Leu Ala Glu Leu Thr Asp Glu 
345 1350 1355 1360 

Val Ser lie Ala Ala Val Asn Gly Pro His Ser Leu Val Leu Ala Gly 
1365 1370 1375 

Ala Glu Ala Pro Leu Val Ala Leu Thr Asp Arg Leu Ala Ala Ala Gly 
1380 1385 1390 



His Lys Thr Arg Arg Leu Met Val Ser Thr Ala Pro His Ser Pro Leu 
1395 1400 1405 

Met Asp Pro Met Leu Glu Glu Phe Arg Ala Val Val Arg Thr Leu Ser 
1410 1415 1420 

Tyr Ala Ala Pro Ala Val Pro Leu Val Ser Thr Val Thr Gly Arg Pro 
425 1430 1435 1440 

Leu Thr Gly Glu Glu Ala Arg Asp Pro Asp His Trp Val Arg His Val 
1445 1450 1455 

-Arg Gin Ser Val Arg„Phe Lys Asp Ala lie Gly Arg ,Leu Arg_Asp G^lu 



1460 



1465 



1470 



Arg Val Thr Gly Phe Leu Glu Leu Gly Ala Glu Pro Ala Leu Thr Pro 
1475 1480 1485 

Met lie Asp Glu Cys Leu Glu Ser Ala Asp Gly Gin Pro Gly Thr Ala 
1490 1495 1500 

Leu Val Pro Ser Leu Arg Ala Gly Val Pro Glu Arg Asp Ala Leu Leu 



505 1510 1515 1520 

Thr Ala Val Ala Arg Val His Ala Gin Gly Val Pro Val Asp Trp Asp 
1525 1530 1535 

Ala Val Leu Pro Gly Ala Glu Ala Ser Val Thr Val Arg Gly Leu Pro 
1540 1545 1550 

Ala Ala Asp Arg Gin Trp Phe Arg Phe Val Pro Asp Gin Gly Ala Pro 
1555 1560 1565 

Leu Thr Leu Ala Asp Arg Ser Leu His Leu Glu Gly Ala Ala His Leu 
1570 1575 1580 

Arg Asp Val Gly Gly Cys Arg Thr Ala Asp Gly Arg Trp Val Lys Met 
585 1590 1595 1600 

Gly Val Leu Tyr Arg Thr Asn Asn Leu His Ala Leu Thr Asp Ala Asp 
1605 1610 1615 

Leu Ala Lys Leu Gin Arg Leu Gly He Arg Thr Asp Phe Asp Leu Arg 
1620 1625 1630 

Met Pro Gly Glu Arg Ala Lys Ala Pro Asn Arg Val Pro Thr Gly Ala 
1635 1640 1645 

Arg Tyr He Val Ala Asp Ala Phe Asp Ala His Leu Arg Asp Gly Leu 
1650 1655 1660 

Ser Leu Ser Glu Arg Asp Leu Glu Arg Leu Arg Glu Ala Arg Leu Ser 
665 1670 1675 1680 

Gly Leu Asp Glu Ala Glu Gin His Ala Leu Val Leu Ala Leu Val Leu 
1685 1690 1695 

Ala Glu Thr Ser Ala Val Leu Gly Gly Gin Glu Thr Pro Gly Glu Glu 
1700 1705 1710 

Pro His Gly Glu Glu Gly His Arg Thr Phe Lys Glu Met Gly He Asn 
1715 1720 1725 

Ser Leu Asn Ala Val Glu Leu Arg Asn Arg Leu He Ala Ala Thr Asd 
1730 1735 1740 

Leu Arg Leu Pro Ala Thr Leu Val Tyr Asp Tyr Pro Thr Pro Lys Ala 
745 1750 1755 1760 

Val Val Arg Leu Val Arg Glu Arg Leu Ala Arg Pro Ala Ser Pro Ala 
1765 1770 1775 

Arg Asp Val Ala Ser Val Val Ala Glu Leu Glu Ser Leu Leu Thr Ala 
1780 1785 1790 

Gly Ala Glu Val Ser Glu Glu Thr Ala Ala Arg Leu Lys Ala Val Thr 
1795 1800 1805 



Ala vax aer xhr lily Thr T hr Gly Ser Gly Ser Gly T hr Gly Ala Gly 
1810 1815 1820 

Ser Gly Gly Ala Leu Asp Leu Val Ser Ala Ser Asp Glu Glu Leu Phe 
825 1830 1835 1840 

Arg Leu Met Asp Ala Glu Ser 
1845 



31 



<210> 3 
<211> 20394 
<212> DNA 

<213> Streptorayces natalensis 

<220> 

<221> CDS 

<222> (1) . • (20394) 



<400> 3 

atg teg aac gag gag aag ctg cgc gag tac etc aag cge gcg ate gcg 
Met Ser Asn Glu Glu Lys Leu Arg Glu Tyr Leu Lys Arg Ala lie Ala 
1 5 10 15 

gac ett cac gag aet cgt eag caa ttg gae gag ace gag gcg aag cag 
Asp Leu His Glu Thr Arg Gin Gin Leu Asp Glu Thr Glu Ala Lys Gin 
20 25 30 

cga gag ccc etc gcg ate gtg teg atg gee tgc cgc ttc ecc ggc ggc 
Arg Glu Pro Leu Ala lie Val Ser Met Ala Cys Arg Phe Pro Gly Gly 
35 40 45 

gtc cgt teg cce gag gag ctg tgg gag ctg ctg cgc gac ggc gtc gac 
Val Arg Ser Pro Glu Glu Leu Trp Glu Leu Leu Arg Asp Gly Val Asp 
50 55 60 

gcg gtt tec tec ttc ecc cgt aac cgc ggc tgg gae ctg gae gcg etc 
Ala Val Ser Ser Phe Pro Arg Asn Arg Gly Trp Asp Leu Asp Ala Leu 
65 70 75 80 

tac cac tec gae ccg gee cac eag ggc ace age tat gcg cgc gag ggc 
Tyr His Ser Asp Pro Ala His Gin Gly Thr Ser Tyr Ala Arg Glu Gly 
85 90 95 

gga ttc ctg cat gac gcg ggc gag ttc gac ccc ggc ttc ttc ggg ate 
Gly Phe Leu His Asp Ala Gly Glu Phe Asp Pro Gly Phe Phe Gly He 
100 105 110 

tec ccg cgc gag gcg etc gee atg gac ccc cag eag egg ctg ctg ctg 
Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 

gag ace gea tgg gaa gee gtc gag egg gee ggt ate gac ccg gag tec 
Glu Thr Ala Trp Glu Ala Val Glu Arg Ala Gly He Asp Pro Glu Ser 
130 135 140 

etc gcg ggc age cga aeg ggt gtc ttt gtc ggc ace ggg cac gga ggg 
Leu Ala Gly Ser Arg Thr Gly Val Phe Val Gly Thr Gly His Gly Gly 
145 150 155 160 

tac gac gee gag ggc cga egg cgt gcc gac gag gtc ggc ggg cac ttg 
Tyr Asp Ala Glu Gly Arg Arg Arg Ala Asp Glu Val Gly Gly His Leu 
165 170 175 

-^tg acg ggc aat cac ate age atc-jgcc tec ^gc egg att teg tat gtc 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



Leu Thr Gly Asn His He Ser He Ala Ser Gly Arg He Ser Tyr Val 
180 185 190 



ctg ggg ctg gaa ggc ect gcc ctg acc gtg gac acg gcc tgc tec teg 624 
Leu Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser 
195 200 205 

teg ctg gtc gee ctg eat ctg gcg atg cac gcg ctg egg cgc gac gaa 672 
Ser Leu Val Ala Leu His Leu Ala Met His Ala Leu Arg Arg Asp Glu 
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210 215 



220 



aT» wT» ?Tr 72^ T ^ ?n cag cgc gcc gcc gac egg 

Ala His Ala Leu Leu Ala Thr Tyr Gly Gin Gin Arg Ala Ala Asp Arg 
355 360 365 ^ ^ 

ccg tgc gga etc gge tec atg aag tee aac ate ggg cac ace cag gcc 
Pro Cys Gly Leu Gly Ser Met Lys Ser Asn He Gly His Thr GlI Ala 
370 375 



380 



gcc gcc ggt ate gee gge gtg atg aaa atg gtc ctg gcg atg egg cac 
Ala Ala Gly He Ala Gly Val Met Lys Met Val Leu All Met I?| His 



395 400 



415 



816 



tgc gcc atg gcc ctg gtg gge gge gcg acc gtg atg tec aca cca caa 7Pn 
cys Ala Met Ala Leu Val Gly Gly All Thr Val Met Ser ?S Pro G?I 

230 235 240 

atg ttc gtg gag tte tec cgc cag cgc gge ctt gcc cce aac aac nac ie:a 
Met Phe val Glu Phe Ser Arg GlI ijg Ify Leu III SI If? I?g 

245 250 255 

tgc aag ccg ttc gcg gcc gcc gcc gac gge acc gge tgg age gag ggt 
Cys Lys Pro Phe Ala Ala Ala Ala Asp Gly Thr Gly sir flu Gly 
2S0 265 270 

fr^i f^"" 3tc gag egg etc agt gac gcc gta cgc aac gge 864 

val Gly Leu Leu Leu Val Glu Arg Leu Ser Asp Ala Val Alg Asn Gly 
275 280 28^ 

tat ccc gtc etc gee gtg ctg aag gge teg gcc gtc aac cag gac gge 912 
^ oon Ala Val Asn GlI lap My 

290 295 300 

gcg tec aac gge ctg acc gcc ccc aac gge ccc teg cag caa cgc gtc 960 
Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gin Gin Aig Val 

310 315 320 

ate cgc cag gcg ctg acc gge gcg gge etc gcc gcc teg gac ate gac 1008 
He Arg Gin Ala Leu Thr Gly Ala Gly Leu Ala Ala Se? Lp He Asp 
325 330 

f?*^ fr^f 2?^ ?f3 cac gge acc gge acc acc etc gge gac ccc gtc gag 1056 
Ala val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Val Glu 
340 345 350 

I?f H?" flf. flf, f?f ^^Z ^^1 H??. 2?? =?9 9ac egg 1104 

365 



1152 



1200 



ggg cac ctg ccc agg acc ctg cac ctg gac gag ccc acc qqq cac ate 124 8 
Gly His Leu Pro Arg Thr Leu His Leu Lp Glu Pro Thr Gl? Hit vll 
405 410 



gac tgg age gag gge aac gcc agg etc etc gcg gag ccc gag ccc tgg 1296 
Asp Trp ser Glu Gly Asn Ala Arg Leu Leu Ala GlS Pro Glu Pro tS 
*20 425 430 

ccg age gcc gge egg ccc cgt cgc gcc gcc gtc tec tec ttc gge ate 1344 
Pro Ser Ala Gly Arg Pro Arg Arg Ala Ala Val Se r Ser Phe Gly He 

435 440 '■ — 445 i . . 

age gge acc aac gcc cac gtc ate ctg gag cag gcg ccc acc cac oaa i-iQ^ 
ser Gly Thr Asn Ala His Val He Leu Glu Gin SI PrI A?I ult l!2 
450 455 



Sf^ ^'^^ Sf^^ «^°9 9CC egg ccg gge gcg ctg ccc tgg 1440 

Ala Glu Pro Ala Pro Glu Pro Ala Ala Arg Pro Gly Ala Leu Pro T« 

470 475 480 
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ate ctg tec gee cgc acc gaa gcg ggc ctg cgt gee cag gee gac cgc 
lie Leu Ser Ala Arg Thr Glu Ala Gly Leu Arg Ala Gin Ala Asp Arg 
485 490 495 



1488 



etc ggc cgc cac eta egg gac cgc gee gac etc gaa ccg gee gee gtc 
Leu Gly Arg His Leu Arg Asp Arg Ala Asp Leu Glu Pro Ala Ala Val 
500 505 510 



1536 



gcc cat gcg etc gcg gac acc egg acc etc atg gaa cac cgc gcg gta 
Ala His Ala Leu Ala Asp Thr Arg Thr Leu Met Glu His Arg Ala Val 
515 520 525 



1584 



gtc gtc gcg ggc gac egg gag gag ttc ctg cgc ggc ctg gac gcc etc 1632 
Val Val Ala Gly Asp Arg Glu Glu Phe Leu Arg Gly Leu Asp Ala Leu 
530 535 540 

gcc gcg ggc cgc acc gcc aac ggc ctg gtc age ggc gtc gcc gtc aag 1680 
Ala Ala Gly Arg Thr Ala Asn Gly Leu Val Ser Gly Val Ala Val Lys 
545 550 555 560 

gcc gcc age gcg ttc etc ttc gcc ggg cag ggc tec cag cga ccg ggc 1728 
Ala Ala Ser Ala Phe Leu Phe Ala Gly Gin Gly Ser Gin Arg Pro Gly 
565 570 575 



atg ggg cgc gaa ctg cac gcc gcg cac cec gtg ttc gee aeg gcc ttc 
Met Gly Arg Glu Leu His Ala Ala His Pro Val Phe Ala Thr Ala Phe 
580 585 590 



1776 



gac gcg gtg tgc gee gaa ctg gac cea cac ctg gac egg ccg ctg cgc 
Asp Ala Val Cys Ala Glu Leu Asp Pro His Leu Asp Arg Pro Leu Arg 
595 600 605 



1824 



gac ate gtc ttc gcc gag gaa ggc age gcc gag gcc gcc ctg etc gac 1872 
Asp lie Val Phe Ala Glu Glu Gly Ser Ala Glu Ala Ala Leu Leu Asp 
610 615 620 

cag ace gcc tac ace cag gcc gcg etc ttc gcc ctg gaa acc gcc ctg 1920 
Gin Thr Ala Tyr Thr Gin Ala Ala Leu Phe Ala Leu Glu Thr Ala Leu 
625 630 635 640 

ttc egg etc gtc gaa tec tgg ggc gtg gea cec egg ttc gtc gcc gga 1968 
Phe Arg Leu Val Glu Ser Trp Gly Val Ala Pro Arg Phe Val Ala Gly 
645 650 655 



cac tee ate ggc gag ctg acc gcc gcc cac gtc agt ggc gtg ctg ace 
His Ser lie Gly Glu Leu Thr Ala Ala His Val Ser Gly Val Leu Thr 
660 665 670 



2016 



etc cac gac gcc gea egg ctg gtc gcc gcg cgc ggc ace etc atg cag 
Leu His Asp Ala Ala Arg Leu Val Ala Ala Arg Gly Thr Leu Met Gin 
675 680 685 



2064 



gcg ctg cec gea ggc ggc gee atg gtg gcg gtc cag gcc acc gag gac 2112 
Ala Leu Pro Ala Gly Gly Ala Met Val Ala Val Gin Ala Thr Glu Asp 

655 700 



gag ate cgc gag cgt etc gcc ggc cac gag gac cac gtc gee etc gcg 
Glu lie Arg Glu Arg Leu Ala Gly His Glu Asp His Val Ala Leu Ala 
705 710 715 720 



2160 



gee gcc aac ggg cec gat tec acc gtc att teg ggc gac gaa cag gee 
Ala Ala Asn Gly Pro Asp Ser Thr Val lie Ser Gly Asp Glu Gin Ala 
725 730 735 



2208 
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v^? ^S"" T^"" f?'' ""^^ ggc cgc cgc acc aag 

Val Thr Glu lie Ala Ala His Trp Glu Ala Gin Gly Arg Arg Thr Lvs 
740 745 750 

egg ctg egg gtc age cac gee ttc eae tee ecg eac atg gac aae ata 
Arg Leu Arg Val Ser His Ala Phe His Ser Pro His Met Asp Asp Met 
755 760 7S5 

Ctg gag gac ttc egg cgc gtc gee ege ggt ctg aec ttc cac gcc ccc 
Leu Glu Asp Phe Arg Arg Val Ala Arg Gly Leu Thr Phe His Ala Pro 
770 775 780 

cge ate ccc gtg gtg tee aeg gtg aec ggc gcg etc gcc acc gaa aac 
Arg He Pro Val Val Ser Thr Val Thr Gly Ala Leu Ala Thr Glu Asp 

790 795 800 

gaa ctg cgc teg ccc gac tac tgg gtg egg cag gtc cgc gaa aec ate 
Glu Leu Arg Ser Pro Asp Tyr Trp Val Arg Gin Val Arg Glu Thr Val 
805 810 815 

cgc ttc tgt get gcc gtg cgc acc ctt gag gcc gag ggc gte ace acc 
Arg Phe Cys Ala Ala Val Arg Thr Leu Glu Ala Glu Gly Val Thr Thr 
820 825 830 

ttc gtg gag ate ggc aec ggc ggc gtc etc aec cee atg gte cag gac 
Phe Val Glu He Gly Thr Gly Gly Val Leu Thr Pro Met Val Gin Aso 
835 840 845 

tgt ctg acc aec etc gaa gag ccc gtt etc gtc ect ctg ctg cgc acc 
Cys Leu Thr Thr Leu Glu Glu Pro Val Leu Val Pro Leu Leu Ara Thr 
850 855 860 

ggc cgc ccc gaa acc gtc get etc aec gag ggc gtc gcc acc gcc ttc 
Gly Arg Pro Glu Thr Val Ala Leu Thr Glu Gly Val Ala Thr Ala Phe 
865 870 875 880 

gtg cac ggt gtc ccc gtc gac egg tee gcc ttc ecg ggc gcg ccc ggt 
Val His Gly Val Pro Val Asp Arg Ser Ala Phe Pro Gly Ala Pro Gly 
885 890 895 

aec tee cgc gcg gac ctg ccc ace tac gee ttc cag cgt cag tgg tac 
Thr Ser Arg Ala Asp Leu Pro Thr Tyr Ala Phe Gin Arg Gin Trp Tyr 
900 905 910 

tgg ctg gac ecg gee gac cac gac gag ggg gag gcg gcc gcc gcc gaa 
Trp Leu Asp Pro Ala Asp His Asp Glu Gly Glu Ala Ala Ala Ala Glu 
915 920 925 

gcg ggc gag gee gga ttc tgg gcg gcc gte gaa cgc gag gac etc cag 
Ala Gly Glu Ala Gly Phe Trp Ala Ala Val Glu Arg Glu Asp Leu Gin 
930 935 940 

gag ctg teg gee gtc ctg gee ate gac ggc age gaa gcg gac tec etc 
Glu Leu Ser Ala Val Leu Ala He Asp Gly Ser Glu Ala Asp Ser Leu 
945 950 955 



2256 



2304 



2352 



2400 



2448 



2496 



2544 



2592 



2640 



2688 



2736 



2784 



2832 



2880 



960 



ggc age ttc ctg ccc ace etc tec tec t gg cgc agg cag cgc agg aec 2^28" 

Gly Ser Phe Leu Pro Thr Leu Ser Ser Trp Arg Arg Gin Arg Arg Thr 

965 970 975 

^Src tac cgc aec cac tgg gcc ecg ege 2976 

Gin Ala Ala Ala Asp Arg Phe Ser Tyr Arg Thr His Trp Ala Pro Arg 
980 985 990 

acc gcc teg ggc ggc ccc acc gcc ace ggg cac tgg etc gtc gtc ctg 3024 



35 

Thr Ala Ser Gly Gly Pro Thr Ala Thr Gly His Trp Leu Val Val Leu 
995 1000 1005 



ccc gaa ggc ggc acc gac gac ccg tgg acc gcc cgc etc ctg gac gcg 
Pro Glu Gly Gly Thr Asp Asp Pro Trp Thr Ala Arg Leu Leu Asp Ala 
1010 1015 1020 



3072 



Ctg aac gac cag ggc ctg cac acc gac gta cgc gaa ctg ccc gcc gac 3120 
Leu Asn Asp Gin Gly Leu His Thr Asp Val Arg Glu Leu Pro Ala Asp 
1025 1030 1035 1040 

cac gag ccc gac gcc tgg ggc cga cac ccc gtg gac ggc gtg etc tgt 3168 
His Glu Pro Asp Ala Trp Gly Arg His Pro Val Asp Gly Val Leu Cys 
1045 1050 1055 



ctg ctg gca etc gac gag egg ccc ace cgc tee tgc eet ccg tac egg 
Leu Leu Ala Leu Asp Glu Arg Pro Thr Arg Ser Cys Pro Pro Tyr Arg 
1060 1065 1070 



3216 



cgc ggg ctg gee gcc acc ace aac get get gcg cgc eet gag ggc gcg 3264 
Arg Gly Leu Ala Ala Thr Thr Asn Ala Ala Ala Arg Pro Glu Gly Ala 
1075 1080 1085 

ggc ate cag gca ccg ctg tgg tgc gtg ace cgc ggc gcc gte gee gtc 3312 
Gly He Gin Ala Pro Leu Trp Cys Val Thr Arg Gly Ala Val Ala Val 
1090 1095 1100 

gac egg cac gag gcg etc aag age ccc tta cag gca cag aca tgg ggc 3360 
Asp Arg His Glu Ala Leu Lys Ser Pro Leu Gin Ala Gin Thr Trp Gly 
1105 1110 1115 1120 

ctg ggc egg gtg gcc gcc ctg gag tec ccg cag age tgg ggc ggg etc 3408 
Leu Gly Arg Val Ala Ala Leu Glu Ser Pro Gin Ser Trp Gly Gly Leu 
1125 1130 1135 

ate gac ctg ccc gac aac ctg gac gga egg gee gtc tec gcg ctg ctg 3456 
lie Asp Leu Pro Asp Asn Leu Asp Gly Arg Ala Val Ser Ala Leu Leu 
1140 1145 1150 

age acc etc gcc ggg gag gag gac cag gtc gcc gtc cgc ccc gee ggg 3504 
Ser Thr Leu Ala Gly Glu Glu Asp Gin Val Ala Val Arg Pro Ala Gly 
1155 1160 1165 

gtc tte gee cgc cgc ctg gaa egg ate aca ccc ggc ggc gac acc ggc 3552 
Val Phe Ala Arg Arg Leu Glu Arg He Thr Pro Gly Gly Asp Thr Gly 
1170 1175 1180 

gac egg tgg age ace cac ggc acc gte ctg gtc ace ggc ggc ace ggt 3600 
Asp Arg Trp Ser Thr His Gly Thr Val Leu Val Thr Gly Gly Thr Gly 
1185 1190 1195 1200 

gcc etc ggc gcg cac etc gcc cac tgg ctg gcc gac gcc gga gee gaa 3648 
Ala Leu Gly Ala His Leu Ala His Trp Leu Ala Asp Ala Gly Ala Glu 
1205 1210 1215 



cac etc gtg etc acc ggc cgc cgc ggc ccg cag gcc ccc ggc gca ccg 
His Leu Val Leu Thr Gly Arg Arg Gly Pro Gin Ala Pro G±y Axa pro 
1220 1225 1230 



3696 



gaa etc gcg gcc gcc etc acc gac egg ggc gtc aag gtc acc etc gcc 3744 
Glu Leu Ala Ala Ala Leu Thr Asp Arg Gly Val Lys Val Thr Leu Ala 
1235 1240 1245 

gee tgc gac gcc gcc gac cgt gat gcg ctg gcg gcc gtc etc gcg gac 3792 
Ala Cys Asp Ala Ala Asp Arg Asp Ala Leu Ala Ala Val Leu Ala Asp 



DESC 



36 



1250 



1255 



1260 



ate ccg ccg cac ctg ccg ctg acc ggc gtc gtc cac gcc gcg ggc gta 3 840 
lie Pro Pro His Leu Pro Leu Thr Gly Val Val His Ala Ala Gly Val 
1265 1270 1275 1280 

Ctg gac gac ggc gta ctg gac gcg etc acc ccc gag cgc ttc gag acc 3888 
Leu Asp Asp Gly Val Leu Asp Ala Leu Thr Pro Glu Arg Phe Glu Thr 
1285 1290 1295 

gta ctg cgc ccc aag gcg egg gcc gca cag aac ctg cac gaa etc acc 3 93 6 
Val Leu Arg Pro Lys Ala Arg Ala Ala Gin Asn Leu His Glu Leu Thr 
1300 1305 1310 



cag gac etc gac ctg gac cac ttc gtg ctg ttc tec teg ate gtc ggc 
Gin Asp Leu Asp Leu Asp His Phe Val Leu Phe Ser Ser lie Val Gly 
1315 1320 1325 



3984 



gtc ctg ggc aac gee gga cag gcc aac tac gcc gcc gcc aac gcc tac 
Val Leu Gly Asn Ala Gly Gin Ala Asn Tyr Ala Ala Ala Asn Ala Tyr 
1330 1335 1340 



4032 



ttg gac gee etc gcc gaa cac egt etc gcc cag ggg etc ccg gee ace 
Leu Asp Ala Leu Ala Glu His Arg Leu Ala Gin Gly Leu Pro Ala Thr 
1345 1350 1355 1360 



4080 



tec gtg tee tgg ggc ect ggg cag gcg gcg gca tgg cac gac age gac 
Ser Val Ser Trp Gly Pro Gly Gin Ala Ala Ala Trp His Asp Ser Asp 
1365 1370 1375 



4128 



gcc gee gac egg atg age cgc gac gga ctg ctg ccc atg gcc gcg gcc 
Ala Ala Asp Arg Met Ser Arg Asp Gly Leu Leu Pro Met Ala Ala Ala 
1380 1385 1390 



4176 



ccg egt cgc cgc ect gcg cea gee etc gcc cag ggc atg aca cag gtg 
Pro Arg Arg Arg Pro Ala Pro Ala Leu Ala Gin Gly Met Thr Gin Val 
1395 1400 1405 



4224 



acc gtg gcc gac ate gac tgg age gca tac gcc ccc gcc ctg acc gcc 
Thr Val Ala Asp lie Asp Trp Ser Ala Tyr Ala Pro Ala Leu Thr Ala 
1410 1415 1420 



4272 



gtc cgc ccc age ccc etc ate ggc gac ctg ccc gag gca cgc cgc gcg 
Val Arg Pro Ser Pro Leu lie Gly Asp Leu Pro Glu Ala Arg Arg Ala 
1425 1430 1435 1440 



4320 



etc ggc ccc gca gaa ggc ccc cgc egg gaa cgc tec ccc ctg cgc gac 
Leu Gly Pro Ala Glu Gly Pro Arg Arg Glu Arg Ser Pro Leu Arg Asp 
1445 1450 1455 



4368 



egg ate ggc gca ctg ccg ccc gcc gaa cag gaa aag gca ttc ctg acc 
Arg lie Gly Ala Leu Pro Pro Ala Glu Gin Glu Lys Ala Phe Leu Thr 
1460 1465 1470 



4416 



atg gtc agg gaa gag gee gcg agg gta ctg gga cac ccc teg ccg gac 
Met Val Arg Glu Glu Ala Ala Arg Val Leu Gly His Pro Ser Pro Asp 
— 1475 1480 — - ■ - 1485- 



4464 



acc gtc gat gcc caa cgc gee ttc cgc gag cag ggg ttc gac tec ctg 
Thr Val Asp Ala Gin Arg Ala Phe Arg Glu Gin Gly Phe Asp Ser Leu 
1490 1495 1500 



4512 



atg gcc gtc gac ctg cgc aac egg etc tee gee gcg aeg ggc ctg egg 
Met Ala Val Asp Leu Arg Asn Arg Leu Ser Ala Ala Thr Gly Leu Arg 
1505 1510 1515 1520 



4560 



37 



ctg ccc gcc acc ctg ctg ttc gac cac ccc acc ccc ctt gcg gcc gcc 
Leu Pro Ala Thr Leu Leu Phe Asp His Pro Thr Pro Leu Ala Ala Ala 
1525 1530 1535 



4608 



gcc tgc ctg cgc tec gaa gtc ctg ggc gcc gca gga ccc gcc acg gtc 
Ala Cys Leu Arg Ser Glu Val Leu Gly Ala Ala Gly Pro Ala Thr Val 
1540 1545 1550 



4656 



gtt cag gca teg acc gcc gcc etc gac gaa ccg gtg gcg ate ate ggc 
Val Gin Ala Ser Thr Ala Ala Leu Asp Glu Pro Val Ala He tie Gly 
1555 1560 1565 



4704 



atg gcc tgc cgc ttc ccc ggc ggc gtg cac tea ccc gag gcc ctg tgg 
Met Ala Cys Arg Phe Pro Gly Gly Val His Ser Pro Glu Ala Leu Trp 
1570 1575 1580 



4752 



egg ctg ctg gee gag ggc ggc gac gcc ate acc ccc atg ccc gee gac 
Arg Leu Leu Ala Glu Gly Gly Asp Ala He Thr Pro Met Pro Ala Asp 
1585 1590 1595 1600 



4800 



egg ggc tgg gac ctg gac egg etc tae cac ccc gac ccc gac cac cag 
Arg Gly Trp Asp Leu Asp Arg Leu Tyr His Pro Asp Pro Asp His Gin 
1605 1610 1615 



4848 



ggc ace age tae gee cgc ggc ggc ggc ttc ctg gac ggc gcg gee gac 
Gly Thr Ser Tyr Ala Arg Gly Gly Gly Phe Leu Asp Gly Ala Ala Asp 
1620 1625 1630 



4896 



ttc gac gcg gac ttc ttc ggc ate teg ceg ege gag gee etc gee atg 4944 
Phe Asp Ala Asp Phe Phe Gly He Ser Pro Arg Glu Ala Leu Ala Met 
1635 1640 1645 

gac ccg cag cag egg ctg etc ctg gaa aca tgg gag gtg etc gaa cag 4992 
Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Trp Glu Val Leu Glu Gin 
1650 1655 1660 



gcg ggg ate gac ccg gaa tec ctg egg ggc age age ace ggt gtc ttc 
Ala Gly He Asp Pro Glu Ser Leu Arg Gly Ser Ser Thr Gly Val Phe 
1665 1670 1675 1680 



5040 



gcg ggc ace aac ace cag gac tae ggc acg gee ctg gac gcg gca cag 
Ala Gly Thr Asn Thr Gin Asp Tyr Gly Thr Ala Leu Asp Ala Ala Gin 
1685 1690 1695 



5088 



gac gaa gee ggc gga cac egg etc ace ggc aac gcg atg age gtc gtc 
Asp Glu Ala Gly Gly His Arg Leu Thr Gly Asn Ala Met Ser Val Val 
1700 1705 1710 



5136 



tee ggc egg gtc tec tae ace ttc ggc ttc gag gga ccg gee etc acc 
Ser Gly Arg Val Ser Tyr Thr Phe Gly Phe Glu Gly Pro Ala Leu Thr 
1715 1720 1725 



5184 



gtg gac acg gcg tgc tec tee teg ctg gtg gee ctg cac atg gee gcg 5232 
Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Met Ala Ala 

1730 1735 ' 1740 



cag gcg ctg cgc cag ggc gaa tgc tee ctg gcg gtc gcg ggc ggt gtg 
Gin Ala Leu Arg Gin Gly Glu Cys Ser Leu Ala Val Ala Gly Gly Val 
1745 1750 1755 1760 



5280 



acg gtg atg gcc acc ccg tec tec ttc gtg gag ttc gcc egg cag cgc 
Thr Val Met Ala Thr Pro Ser Ser Phe Val Glu Phe Ala Arg Gin Arg 
1765 1770 1775 



5328 



:DESC 



38 



ggg ctg gcc ccc gac ggc cgc tgc aag ccg ttc gcg gcg gcc gcc gac 
Gly Leu Ala Pro Asp Gly Arg Cys Lys Pro Phe Ala Ala Ala Ala Asp 
1780 1785 1790 



5376 



ggc acc ggc tgg age gag ggc gtc ggc ctg ctg etc gtg gaa egg etc 
Gly Thr Gly Trp Ser Glu Gly Val Gly Leu Leu Leu Val Glu Arg Leu 
1795 1800 1805 



5424 



age gac gcc cgc cga aac ggc cac cag gtg etc gcc gtc gtc cgc ggt 
Ser Asp Ala Arg Arg Asn Gly His Gin Val Leu Ala Val Val Arg Gly 
1810 1815 1820 



5472 



teg gcg gtc aac cag gac ggc gcg tec aac ggt ctg age gea ccc age 
Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Ser 
1825 1830 1835 1840 



5520 



ggc ccg tee cag cag egg gtg ate egg cag gee ctg gcg aac gcc egg 
Gly Pro Ser Gin Gin Arg Val He Arg Gin Ala Leu Ala Asn Ala Arg 
1845 1850 1855 



5568 



gtg gee gee tec gag gtc gac gee gtg gag gcc cac ggc aeg ggc ace 
Val Ala Ala Ser Glu Val Asp Ala Val Glu Ala His Gly Thr Gly Thr 
1860 1865 1870 



5616 



aeg etc ggt gac ccg ate gag gee cag gcg ctg ctg gcc acc tac ggc 
Thr Leu Gly Asp Pro He Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly 
1875 1880 1885 



5664 



cag gag egg ccg ctg ctg etc ggc gcg gtg aag tee aac etc ggc cac 
Gin Glu Arg Pro Leu Leu Leu Gly Ala Val Lys Ser Asn Leu Gly His 
1890 1895 1900 



5712 



ace cag gee gcc gcc ggt gtg gcg ggc gtg atg aag atg gtg ctg gcg 
Thr Gin Ala Ala Ala Gly Val Ala Gly Val Met Lys Met Val Leu Ala 
1905 1910 1915 1920 



5760 



atg egg cac ggc atg ctg ccg cgc ace ctg cac gtc gac gag ccc ace 
Met Arg His Gly Met Leu Pro Arg Thr Leu His Val Asp Glu Pro Thr 
1925 1930 1935 



5808 



ggg cat gtc gac tgg acc gcg ggc gcg gtc gag ctg etc acc gag cac 
Gly His Val Asp Trp Thr Ala Gly Ala Val Glu Leu Leu Thr Glu His 
1940 1945 1950 



5856 



aeg gac tgg ccc gag acc ggc cac ccc egg cgc gee gcg gtc tec gcg 
Thr Asp Trp Pro Glu Thr Gly His Pro Arg Arg Ala Ala Val Ser Ala 
1955 1960 1965 



5904 



ttc ggc ate age ggc ace aat gcg cac gtg gtg ctg gaa ctg ccc gea 
Phe Gly He Ser Gly Thr Asn Ala His Val Val Leu Glu Leu Pro Ala 
1970 1975 1980 



5952 



gee gaa cag ccc ttg gtc gaa cag ccc teg gee gcg gag ccc gac gcg 
Ala Glu Gin Pro Leu Val Glu Gin Pro Ser Ala Ala Glu Pro Asp Ala 
1985 1990 1995 2000 



6000 



ccg gee ace get ccc gac egg aeg ccc ace gcc tec gac ggg aeg gcg 
Pro Ala Thr Ala Pro Asp Arg Thr Pro Thr Ala Ser Asp Gly Thr Ala 
2005 2010 2015 



6048 



ccg ctg ctg etc tec gcc aag age gag age gcc ctg cgc gcc cag gcg 
Pro Leu Leu Leu Ser Ala Lys Ser Glu Ser Ala Leu Arg Ala Gin Ala 
2020 2025 2030 



6096 



gcc egg ctg cac tec cac ctg gag cgc gac ccc gcg etc egg etc aeg 



6144 



39 



Ala Arg Leu His Ser His Leu Glu Arg Asp Pro Ala Leu Arg Leu Thr 
2035 2040 2045 

gac gcc gcg tac acg ctg atg acg cac cgc acg gcc ttc gcc cac cgc 6192 
Asp Ala Ala Tyr Thr Leu Met Thr His Arg Thr Ala Phe Ala His Arg 
2050 2055 2060 

gcg gcc gtc cgc gcc gcc gac cac gaa gcc gcg ctg cgc gcc ctg acc 6240 
Ala Ala Val Arg Ala Ala Asp His Glu Ala Ala Leu Arg Ala Leu Thr 
2065 2070 2075 2080 

gcc ctg get gcg ggc gag gcc gac ccc gcc gtg gac acc ggc acc gcc 6288 
Ala Leu Ala Ala Gly Glu Ala Asp Pro Ala Val Asp Thr Gly Thr Ala 
2085 2090 2095 

cac acc ggc egg gac gcc gtc etc tte tec ggc eag gga teg caa cgc 6336 
His Thr Gly Arg Asp Ala Val Leu Phe Ser Gly Gin Gly Ser Gin Arg 
2100 2105 2110 

ate gga atg gge egg gag ttg tec ggc ege tae ccg gtg tte gea gag 63 84 
lie Gly Met Gly Arg Glu Leu Ser Gly Arg Tyr Pro Val Phe Ala Glu 
2115 2120 2125 

gee ttc gac ace gtg tge gcg gcc ttg gac gag eat ctg gac ege ccc 6432 
Ala Phe Asp Thr Val Cys Ala Ala Leu Asp Glu His Leu Asp Arg Pro 
2130 2135 2140 

ctg egg gac gtg gtc egg ggc gag gac gag gag ctg ctg aac egg acc 6480 
Leu Arg Asp Val Val Arg Gly Glu Asp Glu Glu Leu Leu Asn Arg Thr 
2145 2150 2155 2160 

gtc tac gcc cag gcg ggg ctg ttc gcc ate gag gtg gee etc ttc egg 6528 
Val Tyr Ala Gin Ala Gly Leu Phe Ala lie Glu Val Ala Leu Phe Arg 
2165 2170 2175 

etc gtg gag tec tgg ggc gta egg ccg cac tac gtg gcc ggg cat tec 6576 
Leu Val Glu Ser Trp Gly Val Arg Pro His Tyr Val Ala Gly His Ser 
2180 2185 2190 

gtc gge gag ate gee gee gcg cac gtc gee ggg gtg ttc teg ctg gcc 6624 
Val Gly Glu lie Ala Ala Ala His Val Ala Gly Val Phe Ser Leu Ala 
2195 2200 2205 

gat gee tge gcg ctg gtg gcg gea cgc gga egg ctg atg cag gcg ctg 6672 
Asp Ala Cys Ala Leu Val Ala Ala Arg Gly Arg Leu Met Gin Ala Leu 
2210 2215 2220 

ccc gee ggc ggc gcg atg gcg gcg ate egg gcg aeg gag gac gaa gtc 6720 
Pro Ala Gly Gly Ala Met Ala Ala lie Arg Ala Thr Glu Asp Glu Val 
2225 2230 2235 2240 

etc ccg cac ctg gcg gac age gtc teg ate gcg gee gtc aac gge ccg 6768 
Leu Pro His Leu Ala Asp Ser Val Ser lie Ala Ala Val Asn Gly Pro 
2245 2250 2255 



te g teg gtc gtc gtc tec ggc gcc gag cac gee gtg etc tec ate gcc 



6816 



Ser Ser Val Val Val Ser Gly Ala Glu His Ala Val Leu Ser He Ala 
2260 2265 2270 



gcg cac ttc gag ggc gcg ggc cgc aag acc acc agg ctg egg gtc teg 6864 
Ala His Phe Glu Gly Ala Gly Arg Lys Thr Thr Arg Leu Arg Val Ser 
2275 2280 2285 

cac gee ttc cac tec ccg etc atg gac ccg atg ctg gcc gac ttc cgc 6912 
His Ala Phe His Ser Pro Leu Met Asp Pro Met Leu Ala Asp Phe Arg 



40 

2290 2295 2300 

gcc gtc gcc gag ggc ctg acc tac ggc gag ccg gag ctg gcc gtc gta 6960 
Ala Val Ala Glu Gly Leu Thr Tyr Gly Glu Pro Glu Leu Ala Val Val 
2305 - 2310 2315 2320 

teg aac gtc acc ggc caa etc gcc acc ccg gac cag ctg cgc acc cec 7008 
Ser Asn Val Thr Gly Gin Leu Ala Thr Pro Asp Gin Leu Arg Thr Pro 
2325 2330 2335 

gag tac tgg gtg acc eat gtc cgc gcg gcg gtg cgc ttc gcg gac ggg 7056 
Glu Tyr Trp Val Thr His Val Arg Ala Ala Val Arg Phe Ala Asp Gly 
2340 2345 2350 

ata egg get ctg ggg gcg gaa ggg gtg acg egg ttc etc gaa etc ggc 7104 
lie Arg Ala Leu Gly Ala Glu Gly Val Thr Arg Phe Leu Glu Leu Gly 
2355 2360 2365 

ccg gac ggc gtc ctg teg gee ttg gcc agg gag teg gea ccg gac gac 7152 
Pro Asp Gly Val Leu Ser Ala Leu Ala Arg Glu Ser Ala Pro Asp Asp 
2370 2375 2380 

gee gtg tge act cec gtg ctg cgc aag gac cgc tec gag gcg gcg ace 7200 
Ala Val Cys Thr Pro Val Leu Arg Lys Asp Arg Ser Glu Ala Ala Thr 
2385 2390 2395 2400 

etc etc gcg gee ctg acg cac ctg cae gta cac gga ace gag ate gac 7248 
Leu Leu Ala Ala Leu Thr His Leu His Val His Gly Thr Glu lie Asp 
2405 2410 2415 

tgg ace gcg ttc etc gcc ggc cgc gac gcg cae gcc gtc gac ctg cec 7296 
Trp Thr Ala Phe Leu Ala Gly Arg Asp Ala His Ala Val Asp Leu Pro 
2420 2425 2430 

acg tac gcc ttc cag cae cag egg ttc tgg ccg acc cec gac cac acc 7344 
Thr Tyr Ala Phe Gin His Gin Arg Phe Trp Pro Thr Pro Asp His Thr 
2435 2440 2445 

cgc ace ggt gac ctg ggc gee gtc ggc etc gaa gcg acc ggg cac ccg 7392 
Arg Thr Gly Asp Leu Gly Ala Val Gly Leu Glu Ala Thr Gly His Pro 
2450 2455 2460 

ctg ctg age gcc gee gtg gaa ctg ccg gac ggt gag ggc ctg ttg ttc 7440 
Leu Leu Ser Ala Ala Val Glu Leu Pro Asp Gly Glu Gly Leu Leu Phe 
2465 2470 2475 2480 

ace acc cgc etc teg etc cag ace cae cec tgg ctg gcc ggg cac gtc 7488 
Thr Thr Arg Leu Ser Leu Gin Thr His Pro Trp Leu Ala Gly His Val 
2485 2490 2495 

gtc atg ggc teg gtc ctg ctg ccg ggg acg gcc ttc gcc gaa etc gcc 7536 
Val Met Gly Ser Val Leu Leu Pro Gly Thr Ala Phe Ala Glu Leu Ala 
2500 2505 2510 

etc cgc gcc gcc gac gag gtg ggc tge gac cgc gtc gac gaa ctg acc 7584 
Leu Arg Ala Ala Asp Glu Val Gly Cys Asp Arg Val Asp Glu Leu Thr 
2S1:S 2520 ^— 2525 - — - - - 

ctg gcc gcc ccg etc gtc ctg cec gag cac ggc ggc gta cag etc cag 763 2 
Leu Ala Ala Pro Leu Val Leu Pro Glu His Gly Gly Val Gin Leu Gin 
2530 2535 2540 

ctg egg gtg ggc cec gee gac gcg tec ggc cgc cgc acc ctg acc gcc 7680 
Leu Arg Val Gly Pro Ala Asp Ala Ser Gly Arg Arg Thr Leu Thr Ala 
2545 2550 2555 2560 



41 



cgc tec agg gcg gag ggc gac ggc gac cgc ccg tgg gtc cag cac gcc 
Arg Ser Arg Ala Glu Gly Asp Gly Asp Arg Pro Trp Val Gin His Ala 
2565 2570 2575 



7728 



acc ggc gtc etc gcg gaa ggg gag teg aeg cec gaa cec ggc tac gac 
Thr Gly Val Leu Ala Glu Gly Glu Ser Thr Pro Glu Pro Gly Tyr Asp 
2580 2585 2590 

ttc cac acc gag tec tgg ccg cec gcc gac gee gcg cec gtc gaa ctg 
Phe His Thr Glu Ser Trp Pro Pro Ala Asp Ala Ala Pro Val Glu Leu 
2595 2600 2605 

tec ggc etc tac ccg gac ttc gcc gca cac ggt ttc gac tac ggt cec 
Ser Gly Leu Tyr Pro Asp Phe Ala Ala His Gly Phe Asp Tyr Gly Pro 
2610 2615 2620 

cac ttc cag ggg ctg egg acc gcc tgg cgc ega ggc gac gag gtg ttc 
His Phe Gin Gly Leu Arg Thr Ala Trp Arg Arg Gly Asp Glu Val Phe 
2625 2630 2635 2640 

gcc gag gtc gcc ctg cec gcc gag gcc gaa ggc gag gca tec gcg tac 
Ala Glu Val Ala Leu Pro Ala Glu Ala Glu Gly Glu Ala Ser Ala Tyr 
2645 2650 2655 

gga etc eat ccg gcg ctg etc gac gcc gcc ctg cac gtc gtc gcg ttc 
Gly Leu His Pro Ala Leu Leu Asp Ala Ala Leu His Val Val Ala Phe 
2660 2665 2670 

aac gga gtg gac cgc ggc gtc gtg ccg ttc tec tgg gag age gtc gcg 
Asn Gly Val Asp Arg Gly Val Val Pro Phe Ser Trp Glu Ser Val Ala 
2675 2680 2685 

ctg cac gcc acc ggc gcc teg gcc gta egg ate egg gtc gtc egg cac 
Leu His Ala Thr Gly Ala Ser Ala Val Arg lie Arg Val Val Arg His 
2690 2695 2700 

age ggc gac aeg gtc tec gtg gat gtc gcc gac ace ace ggc gag cec 
Ser Gly Asp Thr Val Ser Val Asp Val Ala Asp Thr Thr Gly Glu Pro 
2705 2710 2715 2720 

gtc gee tec ate ggc aeg etc gtc ctg egg gcg gtc tee gcc gac cag 
Val Ala Ser lie Gly Thr Leu Val Leu Arg Ala Val Ser Ala Asp Gin 
2725 2730 2735 



7776 



7824 



7872 



7920 



7968 



8016 



8064 



8112 



8160 



8208 



ttg gcg ggc ggc gcg gac ccg gcc gtc cgc gat gcg ctg ttc cgc gtg 
Leu Ala Gly Gly Ala Asp Pro Ala Val Arg Asp Ala Leu Phe Arg Val 
2740 2745 2750 

cag tgg aac cec gta cgc ctg cec ccg gee ggg gee gcg gtg ace gtg 
Gin Trp Asn Pro Val Arg Leu Pro Pro Ala Gly Ala Ala Val Thr Val 
2755 2760 2765 

gcg aeg etc ggc tee ett gee ggc gca ccg ttc gac ggc tac ccg gac 
Ala Thr Leu Gly Ser Leu Ala Gly Ala Pro Phe Asp Gly Tyr Pro Asp 
2770 2775 2780 



8256 



8304 



8352 



Ctg gcg tee ctg gee egg tee ggt egt gtg gcg ggt gcg gtg ctg gta 8400 
Leu Ala Ser Leu Ala Arg Ser Gly Arg Val Ala Gly Ala Val Leu Val 
2785 2790 2795 2800 

ccg gtg gaa gee ggt gee ggc gag gtg gtg gcg gac gat gtc gtg ggg 8448 
Pro Val Glu Ala Gly Ala Gly Glu Val Val Ala Asp Asp Val Val Gly 
2805 2810 2815 



DEBC 



42 



gcg acg cac gca acg gcc gcc egg gcg ctg gac ctg gcc egg teg tgg 
Ala Thr His Ala Thr Ala Ala Arg Ala Leu Asp Leu Ala Arg Ser Trp 
2820 2825 2830 



8496 



ctg gcc gat gae egg tte gcg gee teg cgc ctg gtg ttc gtg acg cgt 
Leu Ala Asp Asp Arg Phe Ala Ala Ser Arg Leu Val Phe Val Thr Arg 
2835 2840 2845 



8544 



ggc gcg gtg tec ggt gcg gat etc gcg ggt gcg gcg gtg tgg ggt ctg 
Gly Ala Val Ser Gly Ala Asp Leu Ala Gly Ala Ala Val Trp Gly Leu 
2850 2855 2860 



8592 



gtg egg teg gcg ctg teg gag cac ceg ggc cgc tte ggt ctg gtg gat 8640 
Val Arg Ser Ala Leu Ser Glu His Pro Gly Arg Phe Gly Leu Val Asp 
2865 2870 2875 2880 

Ctg gat gae gat gee gaa ctg gcg ctg gtg cea egg gtg ttg gcg teg 8688 
Leu Asp Asp Asp Ala Glu Leu Ala Leu Val Pro Arg Val Leu Ala Ser 
2885 2890 2895 



gat gag ceg eag ctg ctg gtg cgc ggt ggt gag gtg ctg gcg gcg egg 
Asp Glu Pro Gin Leu Leu Val Arg Gly Gly Glu Val Leu Ala Ala Arg 
2900 2905 2910 



8736 



Ctg gee egg gcg eag tec teg cac gcg gtg ace tgg gat ceg tec ggc 
Leu Ala Arg Ala Gin Ser Ser His Ala Val Thr Tip Asp Pro Ser Gly 
2915 2920 2925 



8784 



acg gtg etc gtc ace ggt ggc acg ggt ggt ctg ggc cgt gtg atg gca 
Thr Val Leu Val Thr Gly Gly Thr Gly Gly Leu Gly Arg Val Met Ala 
2930 2935 2940 



8832 



cgt cac ttg gtg gtg gaa cac ggg gta egg aac ctg ctg ctg gtc age 
Arg His Leu Val Val Glu His Gly Val Arg Asn Leu Leu Leu Val Ser 
2945 2950 2955 2960 



8880 



cgc cgt ggg ccc gcc gcc gaa ggt gcc gaa gag ctg gtg acg gag etc 
Arg Arg Gly Pro Ala Ala Glu Gly Ala Glu Glu Leu Val Thr Glu Leu 
2965 2970 ' 2975 



8928 



egg cac age ggt gcc gaa gtg gcc gtc gaa gcc tgt gat gtc ace gae 
Arg His Ser Gly Ala Glu Val Ala Val Glu Ala Cys Asp Val Thr Asp 
2980 2985 2990 



8976 



gcg gcc gee gtg gcc gae ctg gtg gcc egg cac egg ate age get gtg 
Ala Ala Ala Val Ala Asp Leu Val Ala Arg His Arg lie Ser Ala Val 
2995 3000 3005 



9024 



gtg cat acg gcc ggt gtc ctg gat gac ggt gtg gtg gag teg ctg aca 
Val His Thr Ala Gly Val Leu Asp Asp Gly Val Val Glu Ser Leu Thr 
3010 3015 3020 



9072 



ceg gag egg ctg teg gcg gtg ttg cgt ceg aag gtg gat gcg gee tgg 
Pro Glu Arg Leu Ser Ala Val Leu Arg Pro Lys Val Asp Ala Ala Trp 
3025 3030 3035 3040 



9120 



aac ctg Cac gag gcg ace agg gat ctg gac ctg g ac gcg ttc gtg gtc 
Asn Leu His Glu Ala Thr Arg Asp Leu Asp Leu Asp Ala Phe Val Val 
3045 3050 3055 



9168 



ttc tec tea gtg gca ggc acg ate ggg age ccc ggt cag gcc aac tac 
Phe Ser Ser Val Ala Gly Thr lie Gly Ser Pro Gly Gin Ala Asn Tyr 
3060 3065 3070 



9216 



gcg gcg ggc aac gcc ttc ctg gat gcc ctg gcc cac cac cgt egg gcg 9264 
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Ala Ala Gly Asn Ala Phe Leu Asp Ala Leu Ala His His Arg Arg Ala 
3075 3080 3085 

gcg ggt ctt ccg gcg gcg teg ctg gca tgg ggc ccc tgg tec egg gac 9312 
Ala Gly Leu Pro Ala Ala Ser Leu Ala Trp Gly Pro Trp Ser Arg Asp 
3090 3095 3100 

ggc ggc atg acc ggc acc ctg acc gac gtc gac tec age gca teg ccc 9360 
Gly Gly Met Thr Gly Thr Leu Thr Asp Val Asp Ser Ser Ala Ser Pro 
3105 3110 3115 3120 

ggc agg cat gee cga act cac ccc cgc aca ggg cgt ggc etc ttc gac 9408 
Gly Arg His Ala Arg Thr His Pro Arg Thr Gly Arg Gly Leu Phe Asp 
3125 3130 3135 



gee gcg ctg gcg gee ggt gac gee cac ctg etc ccc gta cgc ttc gac 
Ala Ala Leu Ala Ala Gly Asp Ala His Leu Leu Pro Val Arg Phe Asp 
3140 3145 3150 



9456 



tgg gcg tec ctg cgc gee cag ggc gag gtg cca ccg ctg ttg cgc ggc 9504 
Trp Ala Ser Leu Arg Ala Gin Gly Glu Val Pro Pro Leu Leu Arg Gly 
3155 3160 3165 

ctg ate agg ace egt gee egg cgc teg gcg gtc ggc ggc teg gcc gcg 9552 
Leu lie Arg Thr Arg Ala Arg Arg Ser Ala Val Gly Gly Ser Ala Ala 
3170 3175 3180 

gca gcc ggc ctg gtg gga cgc ctg age gga egg gga aeg gtg gag egg 9600 
Ala Ala Gly Leu Val Gly Arg Leu Ser Gly Arg Gly Thr Val Glu Arg 
3185 3190 3195 3200 

cgc gag gtg etc ctg gac ctg gta egg gcc cag ate gcg gtc gtc ctg 964 8 
Arg Glu Val Leu Leu Asp Leu Val Arg Ala Gin lie Ala Val Val Leu 
3205 3210 3215 



ggc cac gcg aac ccg gag acg ate gag tee ace cgt gtc ttc cag gac 
Gly His Ala Asn Pro Glu Thr lie Glu Ser Thr Arg Val Phe Gin Asp 
3220 3225 3230 



9696 



etc ggc ttc gac tec ctg acc gcg gtc gaa etc cgc aac cgc etc aac 
Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Asn 
3235 3240 3245 



9744 



aac gcg acc ggc ctg cgc ctt teg gcc acc gcc gtc ttc gac tac ccc 9792 
Asn Ala Thr Gly Leu Arg Leu Ser Ala Thr Ala Val Phe Asp Tyr Pro 
3250 3255 3260 

aeg gcg gac gcg etc gtc gac ttc ctg ctg gac gag ctg ttc ggc gcg 9840 
Thr Ala Asp Ala Leu Val Asp Phe Leu Leu Asp Glu Leu Phe Gly Ala 
3265 3270 3275 3280 

cag gag gag gcc gag ctg ccg gcg ccg gtg ccg tea ccg gcg ggg gcc 9888 
Gin Glu Glu Ala Glu Leu Pro Ala Pro Val Pro Ser Pro Ala, Gly Ala 
3285 3290 3295 



gcc gac crac ccg gtc gtg ate gtc ggc atg age tgc cgc tac ccg ggc 9936 
Ala Asp Asp Pro Val Val He Val Gly Met Ser Cys Arg Tyr Pro Gly 
3300 3305 3310 

ggc gtc ggc teg ccc gag gac ctg tgg cgc ctg gtg teg gag ggc gtg 9984 
Gly Val Gly Ser Pro Glu ABp Leu Trp Arg Leu Val Ser Glu Gly Val 
3315 3320 3325 

gac gcg gtg tec gac ttc ccc acc gac cgt gga tgg gac gtg gag age 1003-2 
Asp Ala Val Ser Asp Phe Pro Thr Asp TUrg Gly Trp Asp Val Glu Ser 



DESC 
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3330 



3335 



3340 



etc tac age cee gac ccc gag gcg etc ggc acc teg tac ace cge tec 10080 
Leu Tyr Ser Pro Asp Pro Glu Ala Leu Gly Thr Ser Tyr Thr Arg Ser 
3345 3350 3355 3360 

ggt gga ttc etc cac gag gcg gcg gag tte gac ccc gat ttc ttc ggg 10128 
Gly Gly Phe Leu His Glu Ala Ala Glu Phe Asp Pro Asp Phe Phe Gly 
3365 3370 3375 

atg age ecg cge gag gcg ctg gcg acc gac gee cag cag egg ctg etg 10176 
Met Ser Pro Arg Glu Ala Leu Ala Thr Asp Ala Gin Gin Arg Leu Leu 
3380 3385 3390 

ctg gag aeg acc tgg gag gee ate gag cge aeg ggc ate gac ecg gcg 10224 
Leu Glu Thr Thr Trp Glu Ala lie Glu Arg Thr Gly lie Asp Pro Ala 
3395 3400 3405 

teg ctg egg ggc age cgt aeg ggc gte tte gcg ggc gtg atg tac ace 10272 
Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Val Met Tyr Thr 
3410 3415 3420 

gac tac ggc gac etc etc gte ggc gac cag tte gag ggc tac cge age 10320 
Asp Tyr Gly Asp Leu Leu Val Gly Asp Gin Phe Glu Gly Tyr Arg Ser 
3425 3430 3435 3440 

aac ggc age gcg gee age ate gee tec ggc egg gte teg tac acc ttc 10368 
Asn Gly Ser Ala Ala Ser lie Ala Ser Gly Arg Val Ser Tyr Thr Phe 
3445 3450 3455 

ggt ttc gag ggt ecg gcg gte aeg gtg gac aeg gca tgc teg teg tec 10416 
Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser 
3460 3465 3470 

ctg gte gcc etg cac tgg gcg gcg cag teg ctg cge teg ggc gag tgc 10464 
Leu Val Ala Leu His Trp Ala Ala Gin Ser Leu Arg Ser Gly Glu Cys 
3475 3480 3485 

teg etc gcg gte gcg ggc ggt gtg aeg gtg atg tec aca ecg aeg aeg 10512 
Ser Leu Ala Val Ala Gly Gly Val Thr Val Met Ser Thr Pro Thr Thr 
3490 3495 3500 

ttc gte gag tte teg egg caa cge gga ctg teg gcg gac ggc cge tgc 10560 
Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ser Ala Asp Gly Arg Cys 
3505 3510 3515 3520 

aag gcg ttc gcc gat gcg gee gac ggc gte ggc tgg ggc gag ggc gte 10608 
Lys Ala Phe Ala Asp Ala Ala Asp Gly Val Gly Trp Gly Glu Gly Val 
3525 3530 3535 

ggc atg etc gta ctg gag cgt ctg teg gac gcg cge cge aac ggg cac 10656 
Gly Met Leu Val Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His 
3540 3545 3550 

egg gtg etc gcg gtg gtg cge ggc agt gcg gtg aac cag gac ggt gcg 10704 
Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala 
3*S5 - 



3560 



3565 



tec aat ggt etg aeg gcg ecg aac ggc ccc gcc cag cag egg gtg ate 
Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ala Gin Gin Arg Val lie 
3570 3575 3580 



10752 



egg cag gcg etg gcg agt gcg ggg ctg teg gcg gcg gat gtg gac gcg 
Arg Gin Ala Leu Ala Ser Ala Gly Leu Ser Ala Ala Asp Val Asp Ala 
3585 3590 3595 3600 



10800 
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gtg gag gcg cac ggt acg ggt acg acg ctg ggc gat ccg ate gag gcc 
Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro lie Glu Ala 
3605 3610 3615 



10848 



cag gcg ctg etc gcc acg tat ggc cag gag cga cct gag gac egg ccg 
Gin Ala Leu Leu Ala Thr Tyr Gly Gin Glu Arg Pro Glu Asp Arg Pro 
3620 3625 3630 



10896 



ttg ctg ctg ggg teg gtc aaa tec aac ate ggt cat gcg cag gcg get 
Leu Leu Leu Gly Ser Val Lys Ser Asn lie Gly His Ala Gin Ala Ala 
3635 3640 3645 



10944 



teg ggt gtg gcg ggt gtc ate aag atg gtg ctg gcg atg egg cac ggt 
Ser Gly Val Ala Gly Val lie Lys Met Val Leu Ala Met Arg His Gly 
3650 3655 3660 



10992 



gtg ctg cct egg acg ctg cat gtg gat gaa ccg teg teg cat gtc gac 
Val Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser Ser His Val Asp 
3665 3670 3675 3680 



11040 



tgg agt gcc ggt gee gtc gag ctg ctg ace tee gag gee gag tgg eeg 
Trp Ser Ala Gly Ala Val Glu Leu Leu Thr Ser Glu Ala Glu Trp Pro 
3685 3690 3695 



11088 



cag ggc gag ggg eeg ege egc gcg ggc gtc tee tee tte ggc gtc age 
Gin Gly Glu Gly Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Val Ser 
3700 3705 3710 



11136 



ggg acg aac gcg cat gtg ate ctg gag cag ccc gga ccg gac gcg gcc 11184 
Gly Thr Asn Ala His Val lie Leu Glu Gin Pro Gly Pro Asp Ala Ala 
3715 3720 3725 

gac gcc gea ccg gac gcc acg gtg ace gat ccc ggc gcg ctg gca tgg 11232 
Asp Ala Ala Pro Asp Ala Thr Val Thr Asp Pro Gly Ala Leu Ala Trp 
3730 3735 3740 



gtg etc tec gca egg aac gaa gcg gee ctg ege tgc cag gcg gcg egc 
Val Leu Ser Ala Arg Asn Glu Ala Ala Leu Arg Cys Gin Ala Ala Arg 
3745 3750 3755 3760 



11280 



Ctg ctg tec ctg gtc gee ggc agt gac gcg ctg tgc gcg egg gac ate 
Leu Leu Ser Leu Val Ala Gly Ser Asp Ala Leu Cys Ala Arg Asp lie 
3765 3770 3775 



11328 



ggc cac teg ctg gtg ace ggg egg teg age tte gee cac egt gcg gtg 
Gly His Ser Leu Val Thr Gly Arg Ser Ser Phe Ala His Arg Ala Val 
3780 3785 3790 



11376 



gtg tgg ggc cag gac egc gac gea ctg gtg egt gee ctg tec gea etc 
Val Trp Gly Gin Asp Arg Asp Ala Leu Val Arg Ala Leu Ser Ala Leu 
3795 3800 3805 



11424 



gcg gtg ggc gag gcc gac gcc ggt ctg gcg gag ggc gcg tee ggc gcg 11472 
Ala Val Gly Glu Ala Asp Ala Gly Leu Ala Glu Gly Ala Ser Gly Ala 

3810 3815 3820 

ggg agg acg gcc tte ctg tte teg ggc cag gga tea caa egg ctg gga 11520 
Gly Arg Thr Ala Phe Leu Phe Ser Gly Gin Gly Ser Gin Arg Leu Gly 
3825 3830 3835 3840 

atg gga tgg gag ttg tac get ege tac ccg gtg tte gcg gac gca tte 11568 
Met Gly Trp Glu Leu Tyr Ala Arg Tyr Pro Val Phe Ala Asp Ala Phe 
3845 3850 3855 



V"^ . .V '''''' ' >'•'•' 



DESC 



46 



gac gcc gtg tgc gcg gcc ttg gac gag cac ctg gag cgc ccc ctg egg 
Asp Ala Val Cys Ala Ala Leu Asp Glu His Leu Glu Arg Pro Leu Ara 
3860 3865 3870 

H^? H^? 9cg gag ctg ctg aac cag acc gcg tac 

Asp Val Val Trp Gly Glu Asp Ala Glu Leu Leu Asn Gin Thr Ala Tvr 
3875 3880 3885 

^f^ ^ ^ag gtg gcg ctg tac egg ctg gcg 

Ala Gin Ala Gly Leu Phe Ala He Glu Val Ala Leu Tyr Arg Leu Ala 
3890 3895 3900 ^ 

gaa teg tgg ggc atg cgc ccg gac ttc gtg gcg ggg cat teg ate ggt 
Glu Ser Trp Gly Met Arg Pro Asp Phe Val Ala Gly His Ser He Glv 
3905 3910 3915 3920 

gag gtc gcc gcg gcc cat gtg teg ggt gtc ttc teg etc ccg gat gee 
Glu Val Ala Ala Ala His Val Ser Gly Val Phe Se? Leu Pro Asp Ala 
3925 3930 3935 

tgt gcg ctg gtg gcg gee ega ggc cga ctg atg cag eaa ctg cce tee 
Cys Ala Leu Val Ala Ala Arg Gly Arg Leu Met Gin Gin Leu Pro Ser 
3940 3945 3950 

ggc ggc gcg atg atg gcg ate egg gcg acc gag gac gag gtc ett ccg 
Gly Gly Ala Met Met Ala He Arg Ala Thr Glu Asp Glu Val Leu Pro 
3955 3960 3965 

cat ctg gcg gaa ggc gtc teg etc gcg gcg gtc aat ggc ccg teg teg 
His Leu Ala Glu Gly Val Ser Leu Ala Ala Val Asn Gly Pro Ser Ser 
3970 3975 3980 

gtc gtg ate teg ggc gee gag gac gcg gtg ctg gcc ate gcg gcg cac 
Val Val He Ser Gly Ala Glu Asp Ala Val Leu Ala He Ala Ala His 
3585 3990 3995 4OOO 

ttc gcg ggg gag ggg cgc aaa acc ace ega ctg egg gtc teg cat gcc 
Phe Ala Gly Glu Gly Arg Lys Thr Thr Arg Leu Arg Val Ser His Ala 
4005 4010 4015 

ttc cac teg ccg etc atg gaa ccg atg ctg gag gaa ttc cgc gcg gtg 
Phe His Ser Pro Leu Met Glu Pro Met Leu Glu Glu Phe Arg Ala Val 
4020 4025 4030 

9tg aca egg ctg tec ttc ggc aeg ccg aeg ate ccc gtc gtc tec aac 
Val Thr Arg Leu Ser Phe Gly Thr Pro Thr He Pro Val Val Ser Asn 
4035 4040 4045 

ctg aeg ggc cgc etc gcc gaa ccc gaa cag etc gcg cac gee gac tac 
Leu Thr Gly Arg Leu Ala Glu Pro Glu Gin Leu Ala His Ala Asp Tvr 
4050 4055 4060 

tgg gtc egg cac gtc cgc gag gca gtg cgc ttc gcg gac ggg ata cag 
Trp Val Arg His Val Arg Glu Ala Val Arg Phe Ala Asp Gly He Gin 

4070 4075 4O8O 



11616 



11664 



11712 



11760 



11808 



11856 



11904 



11952 



12000 



12048 



12096 



12144 



12192 



12240 



-^CB ctg cyy gcg gaa ggg^^tg~ ^cg e gg LLc ctg gag etc ggc- ccg gac - 1 2 2 -8B = 

Ala Leu Arg Ala Glu Gly Val Thr Arg Phe Leu Glu Leu Gly Pro Asp 
4085 4090 4095 

ggt gtg ctg teg gcg atg gcc cgc gag teg gca teg gac gac gcc gtg 12336 
Gly Val Leu Ser Ala Met Ala Arg Glu Se? Ala Se? Asp Asp Ala Val 
4100 4105 4110 

etc gcg ccc gta ctg cgc agg gac egg ccc gag gag aeg gcg ctg ctg 123 84 
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Leu Ala Pro Val Leu Arg Arg Asp Arg Pro Glu Glu Thr Ala Leu Leu 
4115 4120 4125 

ggc gcc ctg gcg cag ctg tac gtc egg ggt gcg cac gtg gac tgg acg 12432 
Gly Ala Leu Ala Gin Leu Tyr Val Arg Gly Ala His Val Asp Trp Thr 
4130 4135 4140 

gtg ccg ttc gcc ggt teg ggt gcg cgc tgg gcg gat ctg ccg acg tac 12480 
Val Pro Phe Ala Gly Ser Gly Ala Arg Trp Ala Asp Leu Pro Thr Tyr 
4145 4150 4155 4160 

gcg ttc cag cac gag egg ttc tgg ccg teg ggc ggt gtg gea cgt ccg 12528 
Ala Phe Gin His Glu Arg Phe Trp Pro Ser Gly Gly Val Ala Arg Pro 
4165 4170 4175 

ggc gat gtg egg tec gcg ggc ctg ggc teg gcc ggg eat ccg ctg ctg 12576 
Gly Asp Val Arg Ser Ala Gly Leu Gly Ser Ala Gly His Pro Leu Leu 
4180 4185 4190 

ggc gcg gcg gtg gaa ctg gcg ggc teg ggc ggc ctg ttg ttc acg ggc 12624 
Gly Ala Ala Val Glu Leu Ala Gly Ser Gly Gly Leu Leu Phe Thr Gly 
4195 4200 4205 

egg ctg teg gtg tec teg cac ccg tgg ctg gcg gac cat gtg gtg ctg 12672 
Arg Leu Ser Val Ser Ser His Pro Trp Leu Ala Asp His Val Val Leu 
4210 4215 4220 

ggc tec gtc etc gtg ccc ggc acc gcg ctg gtg gaa ctg gtg ctg egg 12720 
Gly Ser Val Leu Val Pro Gly Thr Ala Leu Val Glu Leu Val Leu Arg 
4225 4230 4235 4240 

gcg gee gac gag gee ggc tgt gac etc ctg gag gag ctg acg etc gee 12768 
Ala Ala Asp Glu Ala Gly Cys Asp Leu Leu Glu Glu Leu Thr Leu Ala 
4245 4250 4255 

gca ccg ctg gtg ctg ccc gcc teg ggc gcc gcg gtc cag gtt cag gta 12816 
Ala Pro Leu Val Leu Pro Ala Ser Gly Ala Ala Val Gin Val Gin Val 
4260 4265 4270 

gcg gtg ggc gag ccc gat gag gcg ggc cgc egg ccg gtc teg gtc cat 12 864 
Ala Val Gly Glu Pro Asp Glu Ala Gly Arg Arg Pro Val Ser Val His 
4275 4280 4285 

gca cgt gag ggc gag ggc eca tgg acg ctg cac gcc agt ggt gcg gtg 12912 
Ala Arg Glu Gly Glu Gly Pro Trp Thr Leu His Ala Ser Gly Ala Val 
4290 4295 4300 

acc teg ggc gee gaa gtg ccc ccc ttc gac gcc acc gta tgg ccg ccc 12960 
Thr Ser Gly Ala Glu Val Pro Pro Phe Asp Ala Thr Val Trp Pro Pro 
4305 4310 4315 4320 

aag ggc gcg gag ccc gtg gac gtg gcg gac tgc tac gac gta etc gee 13 008 
Lys Gly Ala Glu Pro Val Asp Val Ala Asp Cys Tyr Asp Val Leu Ala 
4325 4330 4335 

-at gcc ggg etc acc tac ggc ccg gcc ttc cac ggc ctg caa gcg cycc 13056 
Asp Ala Gly Leu Thr Tyr Gly Pro Ala Phe His Gly Leu Gin Ala Ala 
4340 4345 4350 

tgg aag etc ggt ggg gac gtc tac gcc gag gcg aag etc ccc gag age 13104 
Trp Lys Leu Gly Gly Asp Val Tyr Ala Glu Ala Lys Leu Pro Glu Ser 
4355 4360 4365 

ace gac ggc gac gca tac ggt ctg cac ccc gcg etc ttc gac gcc gcg 13152 
Thr Asp Gly Asp Ala Tyr Gly Leu His Pro Ala Leu Phe Asp Ala Ala 
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4370 



4375 



4380 



ctg cac gcg teg gcg ctg ggc ggc gcg gaa gcg ggc gga gtc ccg ttc 13200 
Leu His Ala Ser Ala Leu Gly Gly Ala Glu Ala Gly Gly Val Pro Phe 
4385 4390 4395 4400 

tec tgg gee gga gtg teg etg cac gcg ace ggc gcc teg cac etc cgc 13248 
Ser Trp Ala Gly Val Ser Leu His Ala Thr Gly Ala Ser His Leu J^g 
4405 4410 4415 

gtc cge ate cgc gaa gcg ggc ggc gcg ctg teg gtc gcg ate gcg gac 13296 
Val Arg He Arg Glu Ala Gly Gly Ala Leu Ser Val Ala He Ala Asp 
4420 4425 4430 

acg tec ggc gcg ccg gtc gcc teg gtg gag teg ctg gtg ata egt ccg 13344 
Thr Ser Gly Ala Pro Val Ala Ser Val Glu Ser Leu Val He A?g Pro 
4435 4440 4445 

etc teg gcc ggg cag gtg cag gee gee gac egt gac gcc etc tte aag 13392 
Leu Ser Ala Gly Gin Val Gin Ala Ala Asp Arg Asp Ala Leu Phe Lvs 
4450 4455 446O 

f?*' f**^ S*"? '^^^ ^^'^ 9*° 9*^ <=9C gtc gag ccg ggc 13440 

Ala Asp Trp Val Pro Val Pro Leu Thr Asp Glu Arg Val GlS Pro Glv 
4465 4470 4475 4480 

ace ggc ccg gag ggc gag ccg ctg egg acg tac gcg gat etg gat tec 13488 
Thr Gly Pro Glu Gly Glu Pro Leu Arg Thr Tyr Ala Asp Leu Asp Ser 
4485 4490 4495 

Ctg gag ggc gcg gcc gtg cce ggg acg gtc ctg gtc gcg ccg cct tec 13536 
Leu Glu Gly Ala Ala Val Pro Gly Thr Val Leu Val Ala Pro Pro Ser 
4500 4505 4510 

ggc get gcc ggg acg gtg gag tec gta cac gcc gcg ace gtc tqq acq 13584 
Gly Ala Ala Gly Thr Val Glu Ser Val His Ala Ala Thr Val Trp Ala 
4515 4520 4525 

etg gag atg gtg cag gcg tgg ctg gcc gac gac egg tte gee aec teg 13632 
Leu Glu Met Val Gin Ala Trp Leu Ala Asp Asp Arg Phe Ala Thr Ser 
4530 4535 4540 

cga ctg gtg ttc gtc ace cgc ggc gcg gcc tte ggc gcg gat ett gcg 13680 
Arg Leu Val Phe Val Thr Arg Gly Ala Ala Phe Gly Ala Asp Leu All 
4545 4550 4555 ^^60 

gcg gcc gcc gtc egg ggc ctg gtg cgc teg gca cag teg gag aac ecq 13728 
Ala Ala Ala Val Arg Gly Leu Val Arg Se? Ala Gin Sel Glu Asn Irl 
4565 4570 4575 

ggc cgc ttc ggc ctg gtg gac atg gac ggc gac gcc gat acg aec gta 13776 
Gly Arg Phe Gly Leu Val Asp Met Asp Gly Asp Ala Asp Thr Thr Val 
4580 4585 459© 

ccg gcg caa gcg etc gcg aec gac gag cce gaa ctg ctg gtg egt qqt 13824 
Pro Ala Gin Ala Leu Ala Thr Asp Glu Pro Glu LeS LeS Val aIo Gly 
4595 4^00^ 



460S 



ggt gag gtg ctg gcg gee egg etg gtc egg gcg eag tec teg cac acq 
Gly Glu Val Leu Ala Ala Arg Leu Val Arg Ala Gin Ser Se? His Th? 
4610 4615 4620 

gtg acg tgg gat ccg tec ggt acg gtc etg ate ace ggc ggg aec qqt 
Val Thr Trp Asp Pro Ser Gly Thr Val Leu He Thr Gly Gly Thr Gly 
4625 4630 4635 4640 



13872 



13920 



iff 
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ggg ctg ggc cgt agt gtc gcc egg cac ttg gtg age gag cac ggg gtg 
Gly Leu Gly Arg Ser Val Ala Arg His Leu Val Ser Glu His Gly Val 
4645 4650 4655 



13968 



cgc agt ctg ctg ctg gtc age cgc cgt ggt ccc gcg gcc gag ggt gcc 
Arg Ser Leu Leu Leu Val Ser Arg Arg Gly Pro Ala Ala Glu Gly Ala 
4660 4665 4670 



14016 



ggg gag ttg gtg gcc gaa etc agg ggc agt ggc gcc gag gtg gtc ate 14064 
Gly Glu Leu Val Ala Glu Leu Arg Gly Ser Gly Ala Glu Val Val lie 
4675 4680 4685 

gag get tgt gat gtg ace gat gcg gtg geg gtg gcc gat ctg gtg get 14112 
Glu Ala Cys Asp Val Thr Asp Ala Val Ala Val Ala Asp Leu Val Ala 
4690 4695 4700 

egg eat egg ate agt get gtg gtg eat aeg gcc ggt gtt ctg gat gae 14160 
Arg His Arg lie Ser Ala Val Val His Thr Ala Gly Val Leu Asp Asp 
4705 4710 4715 4720 

ggt gtg gtg gag teg ctg aeg eeg gag egg ett geg gtg gtg ttg cgt 14208 
Gly Val Val Glu Ser Leu Thr Pro Glu Arg Leu Ala Val Val Leu Arg 
4725 4730 4735 



ccg aag gtg gat geg gee tgg aae ctg cac gag geg ace agg ggt ctg 
Pro Lys Val Asp Ala Ala Trp Asn Leu His Glu Ala Thr Arg Gly Leu 
4740 4745 4750 



14256 



gat ctg gat gcg ttt gtg gtg tte teg tec gtg gea ggc act ttc ggc 14304 
Asp Leu Asp Ala Phe Val Val Phe Ser Ser Val Ala Gly Thr Phe Gly 
4755 4760 4765 

agt gcg ggt cag gcc aat tac gcg gcg ggt aat get ttc ctg gac gcg 14352 
Ser Ala Gly Gin Ala Asn Tyr Ala Ala Gly Asn Ala Phe Leu Asp Ala 
4770 4775 4780 

ctg geg tat cac cgt egg geg gtg ggt ctg ccg gcg gtg teg ctg gcg 14400 
Leu Ala Tyr His Arg Arg Ala Val Gly Leu Pro Ala Val Ser Leu Ala 
4785 4790 4795 4800 

tgg ggc cet tgg teg cag gac ggt ggt atg acc ggc ace ttg age gac 1444 8 
Trp Gly Pro Trp Ser Gin Asp Gly Gly Met Thr Gly Thr Leu Ser Asp 
4805 4810 4815 



gee gat gtc cag cgc ate gcc egg cag ggc atg ccg ccg ctg ace gtc 
Ala Asp Val Gin Arg lie Ala Arg Gin Gly Met Pro Pro Leu Thr Val 
4820 4825 4830 



14496 



gag gag ggt ctg gcc etc ttc gac gcc gcg etc ggc age gcc gaa ccc 
Glu Glu Gly Leu Ala Leu Phe Asp Ala Ala Leu Gly Ser Ala Glu Pro 
4835 4840 4845 



14544 



atg gea etc ccg gtc cgc ctg gac ctg gcg gcg ctg egg gca caa ggc 14592 
Met Ala Leu Pro Val Arg Leu Asp Leu Ala Ala Leu Arg . Ala Gin Gly 

4850 \ 4855 4860 

gag ccc cag cca ctg ctg cgc ggc etc ate egg aeg agg acc cgc egg 14640 
Glu Pro Gin Pro Leu Leu Arg Gly Leu lie Arg Thr Arg Thr Arg Arg 
4865 4870 4875 4880 

tec ggc gcc gcc gcg gca tec ggc ate gcg cag cgc ett gcc ggg ctg 14688 
Ser Gly Ala Ala Ala Ala Ser Gly He Ala Gin Arg Leu Ala Gly Leu 
4885 4890 4895 



QESC 
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Sf^9 *==*^9f etc gat gtc gta egg gcc cag 14736 
Ser Thr Ala Glu Arg Arg Glu Ala Leu Leu Asp Val Val Arg Ala Gin 
4900 4905 4910 

ccg gaa acg ate gcc cct gac 14784 
lie Ala Thr Val Leu Gly His Ala Gly Pro Glu Thr He Ala Pro Asp 
491S 4920 4925 

egg gcc ttc cag gac etc ggc etc gac tec ctg acg gcg ate gaa etc 14832 
Arg Ala Phe Gin Asp Leu Gly Leu Asp Ser Leu Thr Ala He Glu Leu 
4930 4935 4940 

cgt aac ctg etc ggc aag gcc ace ggg ctg egg etc ccg gea acg ace 14880 
Arg Asn Leu Leu Gly Lys Ala Thr Gly Leu Arg Leu Pro Ala Thr Thr 
4945 4950 4955 4960 

S^? Sf^^ ^tg gcc gee eac etc ttg gac 14928 

Val Phe Asp Tyr Pro Thr Val Asp Ala Leu Ala Ala His Leu Leu Asp 
4965 4970 4975 

gaa ctg ttc ggc gcg gag acg ggg ace gcg acg gag acg ccc etc ccg 14976 
Glu Leu Phe Gly Ala Glu Thr Gly Thr Ala Thr Glu Thr Pro Leu Pro 
4980 4985 4990 

gtg ccc ggc ctg ccg tec ctg gcg gac gat ccg gtc gtg ate gtc ggc 15024 
Val Pro Gly Leu Pro Ser Leu Ala Asp Asp Pro Val Val He Val Gly 
4995 5000 5005 

atg age tgc cgc ttc ccc ggc ggc gtc gcc teg ccg gag gac ctg tgg 15072 
Met Ser Cys Arg Phe Pro Gly Gly Val Ala Ser Pro Glu Asp Leu Trp 
5010 5015 5020 

cgc ctg gtg gcg gac ggc gtg gac gcc gtc tec gcc ttc ccg ace gac 15120 
Arg Leu Val Ala Asp Gly Val Asp Ala Val Ser Ala Phe Pro Thr Asd 
5025 5030 5035 5040 

egg ggc tgg gag ate gac gac ace tac gac ccc gag egg gag ggc gee 15168 
Arg Gly Trp Glu He Asp Asp Thr Tyr Asp Pro Glu Arg Glu Gly Ala 
5045 5050 5055 

ate gcc ace cgt tec ggt gga ttc etc eac gac gcg gcg gag ttc gac 15216 
He Ala Thr Arg Ser Gly Gly Phe Leu His Asp Ala Ala Glu Phe Asp 
5060 5065 5070 

ccc gag ttc ttc ggg atg age ccg cgc gag gcc ctg ace ace gac gcc 15264 
Pro Glu Phe Phe Gly Met Ser Pro Arg Glu Ala Leu Thr Thr Asp Ala 
5075 5080 5085 

S?^ acg ace tgg gag gcg ctg gag cgc gcc 15312 

Gin Gin Arg Leu Leu Leu Glu Thr Thr Trp Glu Ala Leu Glu Arg Ala 
5090 5095 5100 

H?*^ S^? ggc age cgc acg ggt gtc ttc gee 15360 

Gly Met Asp Pro Ala Thr Leu Arg Gly Ser Arg Thr Gly Val Phe Ala 
5105 5110 5115 5120 

vj^v; gtg atg tac -eac gac L ac L e g acg ctg c Lc Lee ggg -cgc -gay ttc - 15408 — 
Gly Val Met Tyr His Asp Tyr Ser Thr Leu Leu Ser Gly Arg Glu Phe 
5125 5130 5135 

S?^ H?^ S?^ Srgc agt gtg gee teg ggc egg 15456 

Glu Gly Tyr Gin Gly Ser Gly Ser Ala Gly Ser Val Ala Ser Gly Ari 
5140 5145 5150 

gtc teg tac ace ttc ggt ttc gag ggt ccg gcg gtc acg gtg gac acg 15504 



51 



Val Ser Tyx Thr Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr 
5155 5160 5165 

gcg tgc teg teg tec etg gtc gee ctg cae etg gea gea eag teg ctg 
Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Ala Gin Ser Leu 
5170 5175 5180 



15552 



cgc teg ggc gag tgc teg etg gcg etc gcg ggc ggt gtg aeg gtg atg 15600 
Arg Ser Gly Glu cys Ser Leu Ala Leu Ala Gly Gly Val Thr Val Met 
5185 5190 5195 5200 

tee aca eeg ctg ace ttc gtg gag ttc tec cgc eag ggc gga etg teg 15648 
Ser Thr Pro Leu Thr Phe Val Glu Phe Ser Arg Gin Gly Gly Leu Ser 
5205 5210 5215 



gcg gac ggc cgc tgc aag gcg ttc gee gat gcg gee gac ggc gtc ggc 
Ala Asp Gly Arg Cys Lys Ala Phe Ala Asp Ala Ala Asp Gly Val Gly 
5220 5225 5230 



15696 



tgg gcc gaa ggc gee gga ate ctg gtg ctg gag cgt ctg teg gac gee 15744 
Trp Ala Glu Gly Ala Gly lie Leu Val Leu Glu Arg Leu Ser Asp Ala 
5235 5240 5245 

cgc cgc aac ggg cae cgc ate etc gcg acg gtg cgc ggc agt gcg gtg 15792 
Arg Arg Asn Gly His Arg lie Leu Ala Thr Val Arg Gly Ser Ala Val 
5250 5255 5260 

aac eag gac ggt gcg tee aat ggt ctg aeg gcg eeg aac ggt cec gee 15840 
Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ala 
5265 5270 5275 5280 

cag eag egg gtg ate egg cag gcg ctg gcg agt gcg ggg ctg teg gcg 15888 
Gin Gin Arg Val lie Arg Gin Ala Leu Ala Ser Ala Gly Leu Ser Ala 
5285 5290 5295 



gcg gat gtg gac gcg gtg gag gcg cae ggt acg ggt acg aeg ctg ggc 
Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly 
5300 5305 5310 



15936 



gat eeg ate gag gee cag gcg ctg etc gcg aeg tat ggc eag gag egg 15984 
Asp Pro lie Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin Glu Arg 
5315 5320 5325 

ccg gag gac egg eeg ttg ctg etc ggc tec gtg aag tec aac ate ggt 16032 
Pro Glu Asp Arg Pro Leu Leu Leu Gly Ser Val Lys Ser Asn lie Gly 
5330 5335 5340 



cae gcg caa gcg get teg ggt gtt gee ggt gtc ate aag atg gtg etg 16080 
His Ala Gin Ala Ala Ser Gly Val Ala Gly Val lie Lys Met Val Leu 
5345 5350 5355 5360 

gcg atg egg cae ggt gtg etg ect egg acg ctg eat gtc gac gag eeg 16128 
Ala Met Arg His Gly Val Leu Pro Arg Thr Leu His Val Asp Glu Pro 
5365 ' 5370 5375 

teg teg cat gtc gac tgg age gcc ggt gee gtc gag ctg ctg acc= tee 16176 

"Ser^S er His Val Asp T rp Se ir"Al' a Gly Ala Val Glu -Leu Leu Thr - Ser 

5380 5385 5390 



gag gcc gag tgg ccg eag ggc gag ggg eeg cgc cgc gcg ggc gtc tec 16224 
Glu Ala Glu Trp Pro Gin Gly Glu Gly Pro Arg Arg Ala Gly Val Ser 
5395 5400 5405 

tec ttc ggc ate agt ggg acg aac gcg cat gtg ate ctg gag cag cec 16272 
Ser Phe Gly lie Ser Gly Thr Asn Ala His Val lie Leu Glu Gin Pro 



52 

5410 5415 5420 



gaa ccg gtc gcg gcg gaa acg gaa teg ate acg ccc aae acc aca cea ifii^n 
Glu^Pro Val Ala Ala^Glu Thr Glu Se? Zle rht Pro ftp ^ ffa Pr^ 



5435 5440 



gac gcc gee gag gae gag geg gee gat tec ggg acg ccg ata cca aca le^fis 
Asp Ala Ala Glu Asp Glu Ala Ala Asp Ser Gl? ThI PrI Val Pro Alt 

5450 5455 

ctg ctg tee ggc agg age gca teg geg etg egg gee cag qca aca caa is-tifi 
Leu Leu Ser Oly Arg Ser Ala Ser All Leu Ala Gin A?a III l?g 
5460 5465 = 



5470 



Ctg ctg tec cga etc gae ggc gat ccg ggg ccg egg ate act gae gte 16464 
Leu Leu ser Arg Leu Asp Gly Asp Pro Gly Pro aII He Thr Sp Val 
5475 5480 



5485 



99^= 9cc tte ccg cac cgc gcg gtg 16512 

5I9O 5?95 ^ ^^"^ ^^'^SSoi ^ 

ate etc gcc gcg aac cga gcg gac ctg ctg cac teg ctg tec gee ctg 16560 
lie Leu Ala Ala Asn Arg Ala Asp Leu Leu His Se? Leu Ser Ala Leu 

5510 5515 5520 

ff^ H?^ H?° *==9 9<=<= 9ta gte gca cag gac cga gcc I66O8 

Ala Glu Gly His Thr Glu Ala Pro Ala Val Val Ala Gil Asp ^ III 

5525 5530 



5535 



cgc teg ggc aag ctg gcc tte ctg tte teg ggg eag gga teg caa cac 
Arg Ser Gly Lys Leu Ala Phe Leu Phe Sel Gl? Gin Gly sl? Gin I?S 
5540 5545 5550 

ctg ggc atg gga egg gag ttg tac ggt ege tac ccg gcg tte acc aao istoa 
Leu Gly Met Gly Arg Glu Leu Tyr Gly A?g Tyr PrI I?I SJe III 
5555 5560 5565 

gcc etc gae gcg gtg tgc gee gcc ctg gac gee cac etg gac cgt ccc 167 1;? 

""^^^n ""^^ Ala His Leu Op llj Pro 

5570 5575 55gQ 

ctg egg gac gtc ate tgg ggc gag gae gcg gaa ctg ctg aac egg ace I6800 
Leu Arg Asp Val He Trp Gly Glu Asp All Glu Leu Leu Asn ^g Thr 

5590 5595 ^5goQ 

ggg tac gee cag aca ggg etg tte gee ate gag gtg ace eta tte r-an i««^a 
Gly Tyr Ala Gin Thr Gly LeS Phe Ala He Glu Val A^a Eeu t^l Sg 
5605 5610 ^ 



5615 



ctg ctg gag teg tgg ggc gta cgc ccg gac cac ctg etg ggg cac tee 16896 
Leu Leu Glu^Ser Trp Gly Val Arg^go Asp His Leu Leu Gly Hil sir 



5630 



ilf r?5 nft ?? ff^ ""^^ ggc gte etc tee etc ccg 16944 
He Gl y Glu He Ala Ala Ala His Val Ala Gly Val Leu Ser Leu PrI ^ 
5635 ■ , SCaO.- f|f;^q 

gac gee tgt gcg etg gtg gcg gcc cga ggt egg ctg atg cag caa eta 16992 
Asp Ala Cys Ala Leu Val Ala Ala Arg Gly A?! Leu Met Gil lln Seu 
5650 5655 5660 



ccg tec ggc ggc gcg atg atg gcg ate egg gcg acc gag gac gag gtc 17040 
^^^'^ Arg Ala Thr Glu Asp Glu Val 

5670 5675 568O 



PESC 
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ctt ccg cat ctg gcg gaa ggc gtc teg etc gcg gcg gtc aac ggg cog 
Leu Pro His Leu Ala Glu 61y Val Ser Leu Ala Ala Val Asn 61y Pro 
5685 5690 5695 



17088 



teg teg gtc gtg gtc tec ggc gee gag gac gag gta etc gee etc gcg 
Ser Ser Val Val Val Ser Gly Ala Glu Asp Glu Val Leu Ala Leu Ala 
5700 5705 5710 



17136 



gcg eac ttc gag gaa gag gga cgc aag ace ace cga ctg egg gtc teg 
Ala His Phe Glu Glu Glu Gly Arg Lys Thr Thr Arg Leu Arg Val Ser 
5715 5720 5725 



17184 



cac gee ttc eac tec ccg etc atg gaa ccg atg ctg gee gac ttc egg 
His Ala Phe His Ser Pro Leu Met Glu Pro Met Leu Ala Asp Phe Arg 
5730 5735 5740 



17232 



gee gtc gee gac ggc atg acc tac gee gcg ccg cgc ate ecc gtg gtc 
Ala Val Ala Asp Gly Met Thr Tyr Ala Ala Pro Arg He Pro Val Val 
5745 5750 5755 5760 



17280 



tec aac gtc acc ggc egg ecc gee acc gcg gaa gag ctg tge tgc gee 
Ser Asn Val Thr Gly Arg Pro Ala Thr Ala Glu Glu Leu Cys Cys Ala 
5765 5770 5775 



17328 



gag tac tgg gtc ggc eac gta cgc gag gee gta egg ttc gee gac ggg 
Glu Tyr Trp Val Gly His Val Arg Glu Ala Val Arg Phe Ala Asp Gly 
5780 5785 5790 



17376 



gtc ggc gcg etc cgc gag cag ggt gtg aeg aeg ttc ctg gaa etc ggc 
Val Gly Ala Leu Arg Glu Gin Gly Val Thr Thr Phe Leu Glu Leu Gly 
5795 5800 5805 



17424 



ecc gac ggc tct etc tec gcg etc gee gee gaa tec gee gee gac gac 
Pro Asp Gly Ser Leu Ser Ala Leu Ala Ala Glu Ser Ala Ala Asp Asp 
5810 5815 5820 



17472 



tec gta ctg gee ecc gta ctg cgc aag aac cgc ecc gag gea ccg gca 
Ser Val Leu Ala Pro Val Leu Arg Lys Asn Arg Pro Glu Ala Pro Ala 
5825 5830 5835 5840 



17520 



ctg etc aeg gee ctg gca cga ctg cac gee cag ggc aeg ccg gtc gac 
Leu Leu Thr Ala Leu Ala Arg Leu His Ala Gin Gly Thr Pro Val Asp 
5845 5850 5855 



17568 



tgg tec gee gee ttc gee ggt aeg ggt gcg egg tgg gtg gac ctg ccg 
Trp Ser Ala Ala Phe Ala Gly Thr Gly Ala Arg Trp Val Asp Leu Pro 
5860 5865 5870 



17616 



aeg tac gca ttc cag cac gag egg ttc tgg ccg teg ggc ggg gcg gcg 
Thr Tyr Ala Phe Gin His Glu Arg Phe Trp Pro Ser Gly Gly Ala Ala 
5875 5880 5885 



17664 



cgc gca ggc gat gtg egg tec gcg ggc ctg ggc teg gee ggg cac ccg 17712 
Arg Ala Gly Asp Val Arg Ser Ala Gly Leu Gly Ser Ala Gly His Pro 
5890 5895 5900 



ctg ctg ggt get gcg gtg gaa ctg gcg ggc tec ggc ggg egg ttg etc 
Leu Leu Gly Ala Ala Val Glu Leu Ala Gly Ser Gly Gly Arg Leu Leu 
5905 5910 5915 5920 



17760 



acc ggg egg ctg tec ctg tec teg cac ccg tgg ctg gcg gat cac gtg 
Thr Gly Arg Leu Ser Leu Ser Ser His Pro Trp Leu Ala Asp His Val 
5925 5930 5935 



17808 
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gtg ctg ggc tec gta ctg gtg ccc ggc acg gcg etc atg gaa ctg gtg 17856 
Val Leu Gly Ser Val Leu Val Pro Gly Thr Ala Leu Met Glu Leu Val 
5940 5945 5950 

ctg egg gcg gee gac gag gtg gac tgc gcc gcg gtg gac qaa etc aca 17904 
Leu Arg Ala Ala Asp Glu Val Asp C?s Ala All Val Lp Glu LeS ?hl 
5955 5960 5955 

etc gcc gcg cca ctg gtc ctg cec gcc teg ggc gcc gcg ate cag gta 17952 
^'"^ ^^"^ ^^'^ Ala Ser Gly Ala Ala He GlI Val 
5975 5980 

cag gta tgg gtg ggc gag ecc gat gag gcg ggc cge egg ccej otc tea 18 000 
Gin Val Trp Val Gly Glu Pro Asp Glu All Gly A?g aI| tto vS Itx 
^^^^ 5990 5995 goOO 

Zf^ f?* H?^ ggc cca tgg acg ctg cac gcc gac ggc 18048 

Val His Ala Arg Glu Gly Glu Gly Pro Trp Thr Leu His Ala Asp Glv 
6005 6010 6015 

f?f rtf, f?"" ^""^ f?^ ff*" H?^ *=*=9 Srat a<=c ata tgg 18096 

Ala Leu Ala Pro Ala Ala Glu Thr Val Pro Phe Asp Thr Ala He 1^ 

6020 6025 6030 

cec ccg cag ggt gee gag cae ctg gac gcg gcg ggc tgt tac gag cqq 18144 
Pro Pro Gin Gly Ala Glu His Leu Asp Ala Ala Gly C?s Tyr Gl2 I?f 
6035 6040 6045 

ttc gcg gac gcc gga ttc gcg tac ggc ccg gtg ttc cag ggc eta caa 18192 

6050 '^'^ ^^^eilt ^ ^ 6?60 ^ ^ ^ 

f?^ f?*" f*^ f''° 3*° 9cc gag gtc gca etc ccc 18240 

Ala Ala Trp Lys Leu Gly Glu Asp He Tyr Ala Glu Val Ala Leu Pro 
^°65 6070 6075 6080 

gaa ggc acg gac ggc aac gcc tac ggc ctg cac ccc gca etc ttc aac 182 88 
Glu Gly Thr Asp Gly Asn Ala Tyr Gly Leu His Pro Ala Leu Phi Asp 
6085 6090 6095 

f?f f?f S?*' f?* ??® f^*" ""^^ 39*= 9^9 acg gac gaa gcc 18336 

Ala Ala Leu His Ala Ala Leu Leu Gly Gly Glu Gly Thr Asp Glu Ala 
6100 6105 6110 

f?^ S^? «v*^ ^9g aac ggg gtg acg etc cae gcc ace ggc get 18384 

Ala Val Pro Phe Ser Trp Asn Gly Val Thr Leu His Ala Thr Gly Ala 
6115 6120 6125 

tee egg gtg agg gta cge ate egt ccc ace gaa ggc ggt acg tea ata 18432 
Ser Arg Val Arg Val Arg He Arg Pro Thr Glu Ily Gly Th? lit He 
6130 6135 6140 

gcc etc gtg gac ace gcc ggt gcg ccg gtc gee teg gtg ega tec ctg 18480 
Ala Leu val Asp Thr Ala Gly Ala Pro Val Ala Se? Val aS Ser lS 

6150 6155 6160 



^u*" M'^'^ ^9 t c c g a te d e c gccr-g gg ca g tUy c a y a ce ggt g ac -ey e u a L 



— , . ^ _ , ~ — — a.v.c MMc Mac -CM C q ac 

Thr Ala Arg Pro He Thr Ala Gly Gin Leu Gin Thr Gly A^p a^^ 
^1^5 6170 €175 

tec ctt ttc cag gtc gac tgg acc acc etc cac etc acg gac gag cge 

Ser Leu Phe Gin Val Asp Trp Thr Thr Leu His Leu Tl^ Asp Glu aIq 

6185 6190 

gcg aac tec etc gcc ctg etc ggc aag gac acc gag ggc ate etc gac 



1852 8- 



18576 



18624 
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Ala Asn Ser Leu Ala Leu Leu Gly Lys Asp Thr Glu Gly lie Leu Asp 
6195 6200 6205 

aca etc tec etc cag cee cac gcg gac etc gac gac etc gcg gcg acg 
Thr Leu Ser Leu Gin Pro His Ala Asp Leu Asp Asp Leu Ala Ala Thr 
6210 6215 6220 



egg tec tgg ctg gcc gac gac egg ttc gcc gee teg cgc ctg gtg ttc 
Arg Ser Trp Leu Ala Asp Asp Arg Phe Ala Ala Ser Arg Leu Val Phe 
6260 6265 6270 

gtg acg cgt ggc gcg gtg tee gge aeg gat etc gcg ggt gcg teg gtg 
Val Thr Arg Gly Ala Val Ser Gly Thr Asp Leu Ala Gly Ala Ser Val 
6275 6280 6285 



gtg ctg gcg gcc egg ctg gtc egg gcg cag tee teg gac acg gtg acg 
Val Leu Ala Ala Arg Leu Val Arg Ala Gin Ser Ser Asp Thr Val Thr 
6340 6345 6350 



18672 



ggc gtc cac gac ace gtg etc gcc ceg ctg cec ace egg ace gee gga 18720 
Gly Val His Asp Thr Val Leu Ala Pro Leu Pro Thr Arg Thr Ala Gly 
6225 6230 6235 6240 

acg gtg gaa tec gtc cat gcc gcc acg aca ggg gca ctg gee ctg ate 18768 
Thr Val Glu Ser Val His Ala Ala Thr Thr Gly Ala Leu Ala Leu lie 
6245 6250 6255 



18816 



18864 



tgg ggc ctg gtg egg teg gcg ttg ttg gag cac ceg ggc cgc ttc ggt 18912 
Trp Gly Leu Val Arg Ser Ala Leu Leu Glu His Pro Gly Arg Phe Gly 
6290 6295 6300 

Ctg gtg gac gtg gac gtg gac caa gac get gaa gtg ceg ett gtg cea 18960 
Leu Val Asp Val Asp Val Asp Gin Asp Ala Glu Val Pro Leu Val Pro 
6305 6310 6315 6320 

agg gcg ttg gcg teg gat gaa ceg cag gtg ttg gtg cgt ggt ggt gag 19008 
Arg Ala Leu Ala Ser Asp Glu Pro Gin Val Leu Val Arg Gly Gly Glu 
6325 6330 6335 



19056 



tgg gat ceg tec ggt aeg gtc ctg ate ace ggc ggg ace ggt ggg ctg 19104 
Trp Asp Pro Ser Gly Thr Val Leu lie Thr Gly Gly Thr Gly Gly Leu 
6355 6360 6365 

ggt cgt agt gtc gee egg cac ttg gtg age gag cac ggg gtg cgc agt 19152 
Gly Arg Ser Val Ala Arg His Leu Val Ser Glu His Gly Val Arg Ser 
6370 6375 6380 

ctg ctg ctg gtc age cgc cgt ggt cee gee gee gag ggt gtc gat gca 19200 
Leu Leu Leu Val Ser Arg Arg Gly Pro Ala Ala Glu Gly Val Asp Ala 
6385 6390 6395 6400 

etc gtt gcc gaa ett gee gag tgc gge gcg cag gtc ace gtc gag get 19248 
Leu Val Ala Glu Leu Ala Glu Cys Gly Ala Gin Val Thr Val Glu Ala 
6405 6410 6415 

tigti gat gt g act aac acq ota acq Qto gcc gat ctg gtg get egg eat 19296 
Cys Asp Val Thr Asp Ala Val Ala Val Ala tep Leii Val Ala Arg "His " 
6420 6425 6430 



egg ate agt get gtg gtg cat acg gcc ggt gtt ctg gat gac ggt gtg 
Arg lie Ser Ala Val Val His Thr Ala Gly Val Leu Asp Asp Gly Val 
6435 6440 6445 

gtg gag teg ctg aeg ceg gag egg ctg teg gcg gtg ctg cgt ceg aag 
Val Glu Ser Leu Thr Pro Glu Arg Leu Ser Ala Val Leu Arg Pro Lys 



19344 



19392 
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6450 



6455 



6460 



gtg gat gcg gcc tgg aac ctg cac gag gcg acc aaa aat eta aah f-^r, lOAAn 
Val^Asp Ala Ala Trp^Asn Leu His Glu Alljhr ^| l!y Leu Cp Leu 

f?^ H*^? '^'^S 9*^9 9*=^ 99= acc ttc ggc agt gcg 19488 

Asp Ala Phe Val Val Phe Ser Ser Val Ala Gly Thr Phe Gly sir Ala 
648S 6490 6495 

^^'^ 9cg gcg ggt aat get ttc ctg gac gcg ctg qca 19536 
Gly Gin Ala Asn Tyr Ala Ala Gly Asn Ala Phe Leu Asp All Leu All 
6500 6505 6510 

tat cac cgt egg gcg gtg ggt ttg ceg gcg gtg teg ctg gcg tgg ggc 19584 
Tyr Hxs Arg Arg Ala Val Gly Leu Pro Ala Val Ser Leu All T?I Gl? 
6515 6520 6525 

cct tgg teg cag gac ggt ggt atg acc ggc acc ttg age gac gcc gat 19632 

^^'^ "^'^ Thr Leu sir Asp Ma Isp 

o530 6535 6540 

fr^^ 99*= **=9 ccg ceg ctg acc gtc gag gag 19680 

Val Gin Arg He Ala Arg Gin Gly Met Pro Pro Leu Thr Val Glu Glu 

6550 6555 6560 

ggt ctg gcc etc ttc gac gcc gcg etc ggc age gcc gaa cce atg gca 19728 
Gly Leu Ala Leu Phe Asp Ala Ala Leu Gly Ser Ala Glu Pro Met Ala 
6565 6570 6575 

etc cce gtc cgc ctg gac etc gcg gcc eta egg gca caa ggc gag cce 19776 
Leu Pro Val Arg Leu Asp Leu Ala Ala Leu Arg Ala Gin Gly Glu Pro 
6580 6585 6590 

cag cea ctg ctg cgc ggc etc ate egg ace ceg ggt cga cgc aeg gcg 19824 
Gin Pro Leu Leu Arg Gly Leu He Arg Thr Pro Gly A?g Aig Thr All 
6595 6600 6605 

f?f f?"" ff^ f?^ ^?'' 9ac ace get gee gee ttc gee ggg cgc ctg 19872 

Ala Ala Ala Thr Glu Gly Asp Thr Ala Ala Ala Phe Ala Gly A?g Leu 
6610 6615 6620 

ace ggg ctg teg gcg gca gaa gga cgc gag gtc gta ctg ggc gcc gta 19920 
Thr Gly Leu Ser Ala Ala Glu Gly Arg Glu Val Val Leu Gly Ala Val 
6625 6630 6635 6640 

cgc age cag ate gcg ggg gtc etc gga cac gcc gaa gee aeg gaa ate 19968 
Arg Ser Gin He Ala Gly Val Leu Gly His Ala Glu Ala Thr Qlu He 
6645 6650 6655 

f*"" f^*" ^^'^ ""^^ 9ac etc gga ttc gac tec etc acc gcg 20016 

Asp Gin Asp Arg Ala Phe Leu Asp Leu Gly Phe Asp Ser Leu Thr Ala 
6660 6665 6670 

S»? f^*^ t^^ °9C ctg ggc gcc gtc acc gga ate cgc ctg ccg 20064 

Val Glu Leu Arg Asn Arg Leu Gly Ala Val Thr Gly He Arg Leu Pro 

6675 ~ ~ " " ■ ' ' " 



6680 



6685 



f?^ f ^ 9ca gaa etc gtc gcc cac 

^^"^ Pro Ala Glu Leu Val Ala His 

6690 6695 6700 

etc cat gcc egg ate gca ccg gag ccg acc gtc ggc ccg gag qcq etc 

Leu His Ala Arg He Ala Pro Glu Pro Thr Val Gly Pro Glu Alf Leu 

^^^^ 6710 6715 6720 



20112 



20160 
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ctg ggc gaa etc gaa agg atg gag aag tec ttc ggc gga etc gac ete 20208 
Leu Gly Glu Leu Glu Arg Met Glu Lys Ser Phe Gly Gly Leu Asp Leu 
6725 6730 6735 

acg gag gag atg cac gaa cag ata gee ggc cgt ctg gaa gte etc egg 20256 
Thr Glu Glu Met His Glu Gin lie Ala Gly Arg Leu Glu Val Leu Arg 
6740 6745 6750 

gcc aag tgg gae gcc ctg egg gac acg gca gcg gca gcc ggg cac gac 20304 
Ala Lys Trp Asp Ala Leu Arg Asp Thr Ala Tlla Ala Ala Gly His Asp 
6755 6760 6765 

ggt tec eeg tee gae gag gae ttc gae tte gag tec gee tec gae gae 20352 
Gly Ser Pro Ser Asp Glu Asp Phe Asp Phe Glu Ser Ala Ser Asp Asp 
6770 6775 6780 

gag gte ttc gae ete ete gae aae gaa ete ggc ctg tec tga 20394 
Glu Val Phe Asp Leu Leu Asp Asn Glu Leu Gly Leu Ser 
6785 6790 6795 

<210> 4 
<211> 6798 
<212> PRT 

<213> Streptomyces natalensis 
<400> 4 

Met Ser Asn Glu Glu Lys Leu Arg Glu Tyr Leu Lys Arg Ala lie Ala 
15 10 15 

Asp Leu His Glu Thr Arg Gin Gin Leu Asp Glu Thr Glu Ala Lys Gin 
20 25 30 

Arg Glu Pro Leu Ala He Val Ser Met Ala Cys Arg Phe Pro Gly Gly 
35 40 45 

Val Arg Ser Pro Glu Glu Leu Trp Glu Leu Leu Arg Asp Gly Val Asp 
50 55 60 

Ala Val Ser Ser Phe Pro Arg Asn Arg Gly Trp Asp Leu Asp Ala Leu 
65 70 75 80 

Tyr His Ser Asp Pro Ala His Gin Gly Thr Ser Tyr Ala Arg Glu Gly 
85 90 95 

Gly Phe Leu His Asp Ala Gly Glu Phe Asp Pro Gly Phe Phe Gly He 
100 105 110 

Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 

Glu Thr Ala Trp Glu Ala Val Glu Arg Ala Gly He Asp Pro Glu Ser 
130 135 140 

Leu Ala Gly Ser Arg Thr Gly Val Phe Val Gly Thr Gly His Gly Gly 

145 ' ~T5X) "T55 160 

Tyr Asp Ala Glu Gly Arg Arg Arg Ala Asp Glu Val Gly Gly His Leu 
165 170 175 

Leu Thr Gly Asn His He Ser He Ala Ser Gly Arg He Ser Tyr Val 
180 185 190 

Leu Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser 
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195 200 205 

ser Leu Val Ala Leu His Leu Ala Met His Ala Leu Arg Arg Asp Glu 
210 215 220 

Cys Ala Met Ala Leu Val Gly Gly Ala Thr Val Met Ser Thr Pro Gin 
225 230 235 240 

Met Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Ara 
245 250 255 

Cys Lys Pro Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Ser Glu Glv 

260 265 270 

Val Gly Leu Leu Leu Val Glu Arg Leu Ser Asp Ala Val Arg Asn Gly 
275 280 285 

^^■^ ^* Ser Ala Val Asn Gin Asp Gly 

290 295 300 

Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gin Gin Arg Val 
3°5 310 315 320 

He Arg Gin Ala Leu Thr Gly Ala Gly Leu Ala Ala Ser Asp He Asp 
325 330 335 

Ala Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Val Glu 
3*0 345 35Q 

Ala His Ala Leu Leu Ala Thr Tyr Gly Gin Gin Arg Ala Ala Asp Arg 
355 360 365 

Pro Cys Gly Leu Gly Ser Met Lys Ser Asn He Gly His Thr Gin Ala 
370 375 

Ala Ala Gly He Ala Gly Val Net Lys Met Val Leu Ala Met Arg His 

390 395 ^ 

Gly His Leu Pro Arg Thr Leu His Leu Asp Glu Pro Thr Gly His Val 
405 410 415 

Asp Trp ser Glu Gly Asn Ala Arg Leu Leu Ala Glu Pro Glu Pro Tro 
420 425 430 ^ 

Pro Ser Ala Gly Arg Pro Arg Arg T^a Ala Val Ser Ser Phe Glv He 
435 440 445 

Ser Gly Thr Asn Ala His Val He Leu Glu Gin Ala Pro Ala His Glu 
450 455 

Ala Glu Pro Ala Pro Glu Pro Ala Ala Arg Pro Gly Ala Leu Pro Trp 

470 475 480 

He Leu Ser Ala Arg Thr Glu Ala Gly Leu Arg Ala Gin Ala Asp Arg 
485 490 495 

Leu »iy Aig Hls i^u Arg Asp A iy Ala Asp Leu Glu fi o Ala jfcLa Val 
500 505 510 

Ala His Ala Leu Ala Asp Thr Arg Thr Leu Met Glu His Arg Ala Val 
515 520 525 

Val Val Ala Gly Asp Arg Glu Glu Phe Leu Arg Gly Leu Asp Ala Leu 
530 535 54Q 
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Ala Ala Gly Arg Thr Ala Asn Gly Leu Val Ser Gly Val Ala Val Lys 
545 550 555 560 

Ala Ala Ser Ala Phe Leu Phe Ala Gly Gin Gly Ser Gin Arg Pro Gly 
565 570 575 

Met Gly Arg Glu Leu His Ala Ala His Pro Val Phe Ala Thr Ala Phe 
580 585 590 

Asp Ala Val Cys Ala Glu Leu Asp Pro His Leu Asp Arg Pro Leu Arg 
595 600 605 

Asp He Val Phe Ala Glu Glu Gly Ser Ala Glu Ala Ala Leu Leu Asp 
610 615 620 

Gin Thr Ala Tyr Thr Gin Ala Ala Leu Phe Ala Leu Glu Thr Ala Leu 
625 630 635 640 

Phe Arg Leu Val Glu Ser Trp Gly Val Ala Pro Arg Phe Val Ala Gly 
645 650 655 

His Ser He Gly Glu Leu Thr Ala Ala His Val Ser Gly Val Leu Thr 
660 665 670 

Leu His Asp Ala Ala Arg Leu Val Ala Ala Arg Gly Thr Leu Met Gin 
675 680 685 

Ala Leu Pro Ala Gly Gly Ala Met Val Ala Val Gin Ala Thr Glu Asp 
690 695 700 

Glu lie Arg Glu Arg Leu Ala Gly His Glu Asp His Val Ala Leu Ala 
705 710 715 720 

Ala Ala Asn Gly Pro Asp Ser Thr Val lie Ser Gly Asp Glu Gin Ala 
725 730 735 

Val Thr Glu He Ala Ala His Trp Glu Ala Gin Gly Arg Arg Thr Lys 
740 745 750 

Arg Leu Arg Val Ser His Ala Phe His Ser Pro His Met Asp Asp Met 
755 760 765 

Leu Glu Asp Phe Arg Arg Val Ala Arg Gly Leu Thr Phe His Ala Pro 
770 775 780 

Arg He Pro Val Val Ser Thr Val Thr Gly Ala Leu Ala Thr Glu Asp 
785 790 795 800 

Glu Leu Arg Ser Pro Asp Tyr Trp Val Arg Gin Val Arg Glu Thr Val 
805 810 815 

Arg Phe Cys Ala Ala Val Arg Thr Leu Glu Ala Glu Gly Val Thr Thr 
820 825 830 

Phe Val Glu He Gly Thr Gly Gly Val Leu Thr Pro Met Val Gin Asp 

835 840 845 

Cys Leu Thr Thr Leu Glu Glu Pro Val Leu Val Pro Leu Leu Arg Thr 
850 855 860 

Gly Arg Pro Glu Thr Val Ala Leu Thr Glu Gly Val Ala Thr Ala Phe 
865 870 875 880 



Val His Gly Val Pro Val Asp Arg Ser Ala Phe Pro Gly Ala Pro Gly 
885 890 895 
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Thr Ser Arg Ala Asp Leu Pro Thr Tyr Ala Phe Gin Arg Gin Trc Tvr 
900 905 910 ^ 

Trp Leu Asp Pro Ala Asp His Asp Glu Gly Glu Ala Ala Ala Ala Glu 
»15 920 925 

Ala Gly Glu Ala Gly Phe Trp Ala Ala Val Glu Arg Glu Asp Leu Gin 
930 935 

Glu Leu Ser Ala Val Leu Ala He Asp Gly Ser Glu Ala Asp Ser Leu 

950 955 960 

Gly Ser Phe Leu Pro Thr Leu Ser. Ser Trp Arg Arg Gin Arg Arg Thr 
965 970 975 

Gin Ala Ala Ala Asp Arg Phe Ser Tyr Arg Thr His Trp Ala Pro Ara 

980 985 ggg ^ 

Thr Ala Ser Gly Gly Pro Thr Ala Thr Gly His Trp Leu Val Val Leu 
995 1000 looS 

""^ Thr Ala Arg Leu Leu Asp Ala 

1010 1015 1020 

Leu Asn Asp Gin Gly Leu His Thr Asp Val Arg Glu Leu Pro Ala Asp 
°25 1030 1035 1040 

His Glu Pro Asp Ala Trp Gly Arg His Pro Val Asp Gly Val Leu Cvs 
1045 1050 1055 

Leu Leu Ala Leu Asp Glu Arg Pro Thr Arg Ser Cys Pro Pro Tyr Ara 
1060 1065 1070 

Arg Gly Leu Ala Ala Thr Thr Asn Ala Ala Ala Arg Pro Glu Gly Ala 
1075 1080 1085 

Gly He Gin Ala Pro Leu Trp Cys Val Thr Arg Gly Ala Val Ala Val 
1090 1095 1100 

Asp Arg His Glu Ala Leu Lys Ser Pro Leu Gin Ala Gin Thr Trp Gly 

1110 1115 1120 

Leu Gly Arg Val Ala Ala Leu Glu Ser Pro Gin Ser Trp Gly Gly Leu 
1125 1130 1135 

He Asp Leu Pro Asp Asn Leu Asp Gly Arg Ala Val Ser Ala Leu Leu 
1140 1145 1150 

Ser Thr Leu Ala Gly Glu Glu Asp Gin Val Ala Val Arg Pro Ala Gly 
1155 1160 1165 

Val Phe Ala Arg Arg Leu Glu Arg He Thr Pro Gly Gly Asp Thr Gly 
ll'^O 1175 1180 

Asp Arg Trp Ser Thr His Gly Thr Val Leu Val Thr Gly Gly Thr Glv 
■■ 1195 ■ . 1300 



Ala Leu Gly Ala His Leu Ala His Trp Leu Ala Asp Ala Gly Ala Glu 
1205 1210 1215 

His Leu Val Leu Thr Gly Arg Arg Gly Pro Gin Ala Pro Gly Ala Pro 
1220 1225 1230 

Glu Leu Ala Ala Ala Leu Thr Asp Arg Gly Val Lys Val Thr Leu Ala 
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1235 



1240 



1245 



Ala Cys Asp Ala Ala Asp Arg Asp Ala Leu Ala Ala Val Leu Ala Asp 
1250 1255 1260 

He Pro Pro His Leu Pro Leu Thr Gly Val Val His Ala Ala Gly Val 
265 1270 1275 1280 

Leu Asp Asp Gly Val Leu Asp Ala Leu Thr Pro Glu Arg Phe Glu Thr 
1285 1290 1295 

Val Leu Arg Pro Lys Ala Arg Ala Ala Gin Asn Leu His Glu Leu Thr 
1300 1305 1310 

Gin Asp Leu Asp Leu Asp His Phe Val Leu Phe Ser Ser He Val Gly 
1315 1320 1325 

Val Leu Gly Asn Ala Gly Gin Ala Asn Tyr Ala Ala Ala Asn Ala Tyr 
1330 1335 1340 

Leu Asp Ala Leu Ala Glu His Arg Leu Ala Gin Gly Leu Pro Ala Thr 
345 1350 1355 1360 

Ser Val Ser Trp Gly Pro Gly Gin Ala Ala Ala Trp His Asp Ser Asp 
1365 1370 1375 

Ala Ala Asp Arg Met Ser Arg Asp Gly Leu Leu Pro Met Ala Ala Ala 
1380 1385 1390 

Pro Arg Arg Arg Pro Ala Pro Ala Leu Ala Gin Gly Met Thr Gin Val 
1395 1400 1405 

Thr Val Ala Asp He Asp Trp Ser Ala Tyr Ala Pro Ala Leu Thr Ala 
1410 1415 1420 

Val Arg Pro Ser Pro Leu He Gly Asp Leu Pro Glu Ala Arg Arg Ala 
425 1430 1435 1440 

Leu Gly Pro Ala Glu Gly Pro Arg Arg Glu Arg Ser Pro Leu Arg Asp 
1445 1450 1455 

Arg He Gly Ala Leu Pro Pro Ala Glu Gin Glu Lys Ala Phe Leu Thr 
1460 1465 1470 

Met Val Arg Glu Glu Ala Ala Arg Val Leu Gly His Pro Ser Pro Asp 
1475 1480 1485 

Thr Val Asp Ala Gin Arg Ala Phe Arg Glu Gin Gly Phe Asp Ser Leu 
1490 1495 1500 

Met Ala Val Asp Leu Arg Asn Arg Leu Ser Ala Ala Thr Gly Leu Arg 
505 1510 1515 1520 

Leu Pro Ala Thr Leu Leu Phe Asp His Pro Thr Pro Leu Ala Ala Ala 
1525 1530 1535 

Ala Cys Leu Arg Ser Glu val ijeu uiy Ala Ala lily-^ro Ala rhr val 
1540 1545 1550 



Val Gin Ala Ser Thr Ala Ala Leu Asp Glu Pro Val Ala He He Gly 
1555 1560 1565 

Met Ala Cys Arg Phe Pro Gly Gly Val His Ser Pro Glu Ala Leu Trp 
1570 1575 1580 
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Arg Leu Leu Ala Glu Gly Gly Asp Ala He Thr Pro Met Pro Ala Asp 

1590 1595 jgo^ 

Arg Gly Trp Asp Leu Asp Arg Leu Tyr His Pro Asp Pro Asp His Gin 
1605 1610 161S 

Gly Thr Ser Tyr Ala Arg Gly Gly Gly Phe Leu Asp Gly Ala Ala Asp 
1°20 1625 1630 

Phe Asp Ala Asp Phe Phe Gly He Ser Pro Arg Glu Ala Leu Ala Met 
1635 1640 164S 

^"""^ ^ Tit Glu Val Leu Glu Gin 

1°50 1655 1660 

Ala Gly He Asp Pro Glu Ser Leu Arg Gly Ser Ser Thr Gly Val Phe 

1670 1675 ' 1680 

Ala Gly Thr Asn Thr Gin Asp Tyr Gly Thr Ala Leu Asp Ala Ala Gin 
1685 1690 1695 

Asp Glu Ala Gly Gly His Arg Leu Thr Gly Asn Ala Met Ser Val Val 
1700 1705 1710 

Ser Gly Arg Val Ser Tyr Thr Phe Gly Phe Glu Gly Pro Ala Leu Thr 
1715 1720 1725 

^^'^ ^« Val Ala Leu His Met Ala Ala 
1'30 1735 1740 

Gin Ala Leu Arg Gin Gly Glu Cys Ser Leu Ala Val Ala Gly Glv Val 

1750 1755 '^1760 

Thr Val Met Ala Thr Pro Ser Ser Phe Val Glu Phe Ala Arg Gin Ara 
1765 1770 1775 

Gly Leu Ala Pro Asp Gly Arg Cys Lys Pro Phe Ala Ala Ala Ala Asp 
1780 1785 1790 

Gly Thr Gly Trp Ser Glu Gly Val Gly Leu Leu Leu Val Glu Arg Leu 
1795 1800 1805 ^ 

^^^,^5 ^•'•^ ^9 ^ Gin Val Leu Ala Val Val Arg Glv 

1810 1815 1820 

Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Ser 

1830 1835 1840 

Gly Pro Ser Gin Gin Arg Val He Arg Gin Ala Leu Ala Asn Ala Arg 
1845 1850 1855 

Val Ala Ala Ser Glu Val Asp Ala Val Glu Ala His Gly Thr Gly Thr 
I860 1865 1870 

Thr Leu Gly Asp Pro He Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly 
1875 1880 1885 



^ Ala Val Lys Ser Asn Leu Gly His 

1890 1895 1900 

Thr Gin Ala Ala Ala Gly Val Ala Gly Val Met Lys Met Val Leu Ala 

1910 1915 1920 

Met Arg His Gly Met Leu Pro Arg Thr Leu His Val Asp Glu Pro Thr 
1925 1930 1935 
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Gly His Val Asp Trp Thr Ala Gly Ala Val Glu Leu Leu Thr Glu His 
1940 1945 1950 

Thr Asp Trp Pro Glu Thr Gly His Pro Arg Arg Ala Ala Val Ser Ala 
1955 1960 1965 

Phe Gly lie Ser Gly Thr Asn Ala His Val Val Leu Glu Leu Pro Ala 
1970 1975 1980 

Ala Glu Gin Pro Leu Val Glu Gin Pro Ser Ala Ala Glu Pro Asp Ala 
985 1990 1995 2000 

Pro Ala Thr Ala Pro Asp Arg Thr Pro Thr Ala Ser Asp Gly Thr Ala 
2005 2010 2015 

Pro Leu Leu Leu Ser Ala Lys Ser Glu Ser Ala Leu Arg Ala Gin Ala 
2020 2025 2030 

Ala Arg Leu His Ser His Leu Glu Arg Asp Pro Ala Leu Arg Leu Thr 
2035 2040 2045 

Asp Ala Ala Tyr Thr Leu Met Thr His Arg Thr Ala Phe Ala His Arg 
2050 2055 2060 

Ala Ala Val Arg Ala Ala Asp His Glu Ala Ala Leu Arg Ala Leu Thr 
065 2070 2075 2080 

Ala Leu Ala Ala Gly Glu Ala Asp Pro Ala Val Asp Thr Gly Thr Ala 
2085 2090 2095 

His Thr Gly Arg Asp Ala Val lieu Phe Ser Gly Gin Gly Ser Gin Arg 
2100 2105 2110 

lie Gly Met Gly Arg Glu Leu Ser Gly Arg Tyr Pro Val Phe Ala Glu 
2115 2120 2125 

Ala Phe Asp Thr Val Cys Ala Ala Leu Asp Glu His Leu Asp Arg Pro 
2130 2135 2140 

Leu Arg Asp Val Val Arg Gly Glu Asp Glu Glu Leu Leu Asn Arg Thr 
145 2150 2155 2160 

Val Tyr Ala Gin Ala Gly Leu Phe Ala lie Glu Val Ala Leu Phe Afg 
2165 2170 2175 

Leu Val Glu Ser Trp Gly Val Arg Pro His Tyr Val Ala Gly His Ser 
2180 2185 2190 

Val Gly Glu He Ala Ala Ala His Val Ala Gly Val Phe Ser Leu Ala 
2195 2200 2205 

Asp Ala Cys Ala Leu Val Ala Ala Arg Gly Arg Leu Met Gin Ala Leu 
2210 2215 2220 

Pro Ala Gly Gly Ala Met Ala Ala He Arg Ala Thr Glu Asp Glu Val 
—223 = 2230 — : 2235 ■■ ^ ^ 2240 

Leu Pro His Leu Ala Asp Ser Val Ser He Ala Ala Val Asn Gly Pro 
2245 2250 2255 



Ser Ser Val Val Val Ser Gly Ala Glu His Ala Val Leu Ser He Ala 
2260 2265 2270 

Ala His Phe Glu Gly Ala Gly Arg Lys Thr Thr Arg Leu Arg Val Ser 
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2275 



2280 



2285 



His Ala Phe His Ser Pro Leu Met Asp Pro Met Leu Ala Asp Phe Arg 
2290 2295 2300 

Ala Val Ala Glu Gly Leu Thr Tyr Gly Glu Pro Glu Leu Ala Val Val 
305 2310 2315 2320 

Ser Asn Val Thr Gly Gin Leu Ala Thr Pro Asp Gin Leu Arg Thr Pro 
2325 2330 2335 

Glu Tyr Trp Val Thr His Val Arg Ala Ala Val Arg Phe Ala Asp Gly 
2340 2345 2350 

lie Arg Ala Leu Gly Ala Glu Gly Val Thr Arg Phe Leu Glu Leu Gly 
2355 2360 2365 

Pro Asp Gly Val Leu Ser Ala Leu Ala Arg Glu Ser Ala Pro Asp Asp 
2370 2375 2380 

Ala Val Cys Thr Pro Val Leu Arg Lys Asp Arg Ser Glu Ala Ala Thr 
385 2390 2395 2400 

Leu Leu Ala Ala Leu Thr His Leu His Val His Gly Thr Glu lie Asp 
2405 2410 2415 

Trp Thr Ala Phe Leu Ala Gly Arg Asp Ala His Ala Val Asp Leu Pro 
2420 2425 2430 

Thr Tyr Ala Phe Gin His Gin Arg Phe Trp Pro Thr Pro Asp His Thr 
2435 2440 2445 

Arg Thr Gly Asp Leu Gly Ala Val Gly Leu Glu Ala Thr Gly His Pro 
2450 2455 2460 

Leu Leu Ser Ala Ala Val Glu Leu Pro Asp Gly Glu Gly Leu Leu Phe 
465 2470 2475 2480 

Thr Thr Arg Leu Ser Leu Gin Thr His Pro Trp Leu Ala Gly His Val 
2485 2490 2495 

Val Met Gly Ser Val Leu Leu Pro Gly Thr Ala Phe Ala Glu Leu Ala 
2500 2505 2510 

Leu Arg Ala Ala Asp Glu Val Gly Cys Asp Arg Val Asp Glu Leu Thr 
2515 2520 2525 

Leu Ala Ala Pro Leu Val Leu Pro Glu His Gly Gly Val Gin Leu Gin 
2530 2535 2540 

Leu Arg Val Gly Pro Ala T^p Ala Ser Gly Arg Arg Thr Leu Thr Ala 
545 2550 2555 2560 

Arg Ser Arg Ala Glu Gly Asp Gly Asp Arg Pro Trp Val Gin His Ala 
2565 2570 2575 

Thr Gly Val Leu A la nln niy oln qf^i- tV*t- P-rn r^iy pro r i iy Ty r Aflp 
2580 2585 2590 

Phe His Thr Glu Ser Trp Pro Pro Ala Asp Ala Ala Pro Val Glu Leu 
2595 2600 2605 

Ser Gly Leu Tyr Pro Asp Phe Ala Ala His Gly Phe Asp Tyr Gly Pro 



2610 



2615 



2620 
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His Phe Gin Gly Leu Arg Thr Ala Trp Arg Arg Gly Asp Glu Val Phe 
625 2630 2635 2640 

Ala Glu Val Ala Leu Pro Ala Glu Ala Glu Gly Glu Ala Ser Ala Tyr 
2645 2650 2655 

Gly Leu His Pro Ala Leu Leu Asp Ala Ala Leu His Val Val Ala Phe 
2660 2665 2670 

Asn Gly Val Asp Arg Gly Val Val Pro Phe Ser Trp Glu Ser Val Ala 
2675 2680 2685 

Leu His Ala Thr Gly Ala Ser Ala Val Arg lie Arg Val Val Arg His 
2690 2695 2700 

Ser Gly Asp Thr Val Ser Val Asp Val Ala Asp Thr Thr Gly Glu Pro 
705 2710 2715 2720 

Val Ala Ser lie Gly Thr Leu Val Leu Arg Ala Val Ser Ala Asp Gin 
2725 2730 2735 

Leu Ala Gly Gly Ala Asp Pro Ala Val Arg Asp Ala Leu Phe Arg Val 
2740 2745 2750 

Gin Trp Asn Pro Val Arg Leu Pro Pro Ala Gly Ala Ala Val Thr Val 
2755 2760 2765 

Ala Thr Leu Gly Ser Leu Ala Gly Ala Pro Phe Asp Gly Tyr Pro Asp 
2770 2775 2780 

Leu Ala Ser Leu Ala Arg Ser Gly Arg Val Ala Gly Ala Val Leu Val 
785 2790 2795 2800 

Pro Val Glu Ala Gly Ala Gly Glu Val Val Ala Asp Asp Val Val Gly 
2805 2810 2815 

Ala Thr His Ala Thr Ala Ala Arg Ala Leu Asp Leu Ala Arg Ser Trp 
2820 2825 2830 

Leu Ala Asp Asp Arg Phe Ala Ala Ser Arg Leu Val Phe Val Thr Arg 
2835 2840 2845 

Gly Ala Val Ser Gly Ala Asp Leu Ala Gly Ala Ala Val Trp Gly Leu 
2850 2855 2860 

Val Arg Ser Ala Leu Ser Glu His Pro Gly Arg Phe Gly Leu Val Asp 
865 2870 2875 2880 

Leu Asp Asp Asp Ala Glu Leu Ala Leu Val Pro Arg Val Leu Ala Ser 
2885 2890 2895 

Asp Glu Pro Gin Leu Leu Val Arg Gly Gly Glu Val Leu Ala Ala Arg 
2900 2905 2910 

Leu Ala Arg Ala Gin Ser Ser His Ala Val Thr Trp Asp Pro Ser Gly 
2915 2920 2925 

Thr Val Leu Val Thr Gly Gly Thr Gly Gly Leu Gly Arg Val Met Ala 
2930 2935 2940 

Arg His Leu Val Val Glu His Gly Val Arg Asn Leu Leu Leu Val Ser 
945 2950 2955 2960 

Arg Arg Gly Pro Ala Ala Glu Gly Ala Glu Glu Leu Val Thr Glu Leu 
2965 2970 2975 
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Arg His Ser Gly Ala Glu Val Ala Val Glu Ala Cys Asp Val Thr Asd 
2980 2985 2990 

Ala Ala Ala Val Ala Asp Leu Val Ala Arg His Arg He Ser Ala Val 
2995 3000 3005 

Val His Thr Ala Gly Val Leu Asp Asp Gly Val Val Glu Ser Leu Thr 
3010 3015 3020 

Pro Glu Arg Leu Ser Ala Val Leu Arg Pro Lys Val Asp Ala Ala Trc 
025 3030 3035 3040 

Asn Leu His Glu Ala Thr Arg Asp Leu Asp Leu Asp AJLa Phe Val Val 
3045 3050 3055 

Phe Ser Ser Val Ala Gly Thr He Gly Ser Pro Gly Gin Ala Asn Tyr 
3060 3065 3070 

Ala Ala Gly Asn Ala Phe Leu Asp Ala Leu Ala His His Arg Aro Ala 
3075 3080 3085 

Ala Gly Leu Pro Ala Ala Ser Leu Ala Trp Gly Pro Trp Ser Ara Asd 
3090 3095 3100 ^ ^ 

Gly Gly Met Thr Gly Thr Leu Thr Asp Val Asp Ser Ser Ala Ser Pro 

3110 3115 3120 

Gly Arg His Ala Arg Thr His Pro Arg Thr Gly Arg Gly Leu Phe Asp 
3125 3130 3135 

Ala Ala Leu Ala Ala Gly Asp Ala His Leu Leu Pro Val Arg Phe Asp 
3140 3145 3150 

Trp Ala Ser Leu Arg Ala Gin Gly Glu Val Pro Pro Leu Leu Arg Gly 
3155 3160 3165 

Leu He T^g Thr Arg Ala Arg Arg Ser Ala Val Gly Gly Ser Ala Ala 
3170 3175 3180 

Ala Ala Gly Leu Val Gly Arg Leu Ser Gly Arg Gly Thr Val Glu Ara 

3190 3195 3200 

Arg Glu Val Leu Leu Asp Leu Val Arg Ala Gin He Ala Val Val Leu 
3205 3210 3215 

Gly His Ala Asn Pro Glu Thr He Glu Ser Thr Arg Val Phe Gin Asp 
3220 3225 3230 

Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Asn 
3235 3240 3245 

Asn Ala Thr Gly Leu Arg Leu Ser Ala Thr Ala Val Phe Asp Tvx Pro 
3250 3255 3260 

Thr Ala Asp Ala Leu Val Asp Phe Leu Leu Asp Glu Leu Phe Gly Ala 
265- 3270 ^ 3275 -^'Jqq 

Gin Glu Glu Ala Glu Leu Pro Ala Pro Val Pro Ser Pro Ala Gly Ala 
3285 3290 3295 

Ala Asp Asp Pro Val Val He Val Gly Met Ser Cys Arg Tyr Pro Gly 
3300 3305 3310 

Gly Val Gly Ser Pro Glu Asp Leu Trp Arg Leu Val Ser Glu Gly Val 
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3315 3320 3325 

Asp Ala Val Ser Asp Phe Pro Thr Asp Arg Gly Trp Asp Val Glu Ser 
3330 3335 3340 

lieu Tyr Ser Pro Asp Pro Glu Ala Leu Gly Thr Ser Tyr Thr Arg Ser 
345 3350 3355 3360 

Gly Gly Phe Leu His Glu Ala Ala Glu Phe Asp Pro Asp Phe Phe Gly 
3365 3370 3375 

Met Ser Pro Arg Glu Ala Leu Ala Thr Asp Ala Gin Gin Arg Leu Leu 
3380 3385 3390 

Leu Glu Thr Thr Trp Glu Ala lie Glu Arg Thr Gly lie Asp Pro Ala 
3395 3400 3405 

Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Val Met Tyr Thr 
3410 3415 3420 

Asp Tyr Gly Asp Leu Leu Val Gly Asp Gin Phe Glu Gly Tyr Arg Ser 
425 3430 3435 3440 

Asn Gly Ser Ala Ala Ser lie Ala Ser Gly Arg Val Ser Tyr Thr Phe 
3445 3450 3455 

Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser 
3460 3465 3470 

Leu Val Ala Leu His Trp Ala Ala Gin Ser Leu Arg Ser Gly Glu Cys 
3475 3480 3485 

Ser Leu Ala Val Ala Gly Gly Val Thr Val Met Ser Thr Pro Thr Thr 
3490 3495 3500 

Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ser Ala Asp Gly Arg Cys 
505 3510 3515 3520 

Lys Ala Phe Ala Asp Ala Ala Asp Gly Val Gly Trp Gly Glu Gly Val 
3525 3530 3535 

Gly Met Leu Val Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His 
3540 3545 3550 

Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala 
3555 3560 3565 

Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ala Gin Gin Arg Val lie 
3570 3575 3580 

Arg Gin Ala Leu Ala Ser Ala Gly Leu Ser Ala Ala Asp Val Asp Ala 
585 3590 3595 3600 

Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro lie Glu Ala 
3605 3610 3615 



Gin Ala Leu — L eu A l a Thr Tyr Gly Gin Glu Arg Pr o _ G l u. A sp A rg. Pro 
3620 3625 3630 

Leu Leu Leu Gly Ser Val Lys Ser Asn lie Gly His Ala Gin Ala Ala 
3635 3640 3645 

Ser Gly Val Ala Gly Val He Lys Met Val Leu Ala Met Arg His Gly 
3650 3655 3660 
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Val Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser Ser His Val asd 

3670 3675 3680 

Trp Ser Ala Gly Ala Val Glu Leu Leu Thr Ser Glu Ala Glu Trp Pro 
3685 3690 3695 

Gin Gly Glu Gly Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Val Ser 
3700 3705 3710 

Gly Thr Asn Ala His Val lie Leu Glu Gin Pro Gly Pro Asp Ala Ala 
3715 3720 3725 

Asp Ala Ala Pro Asp Ala Thr Val Thr Asp Pro Gly Ala Leu Ala Tro 
3730 3735 3740 

Val Leu Ser Ala Arg Asn Glu Ala Ala Leu Arg Cys Gin Ala Ala Arg 

3750 3755 3750 

Leu Leu Ser Leu Val Ala Gly Ser Asp Ala Leu Cys Ala Arg Asp He 
3765 3770 3775 

Gly His Ser Leu Val Thr Gly Arg Ser Ser Phe Ala His Arg Ala Val 
3780 3785 3790 

Val Trp Gly Gin Asp Arg Asp Ala Leu Val Arg Ala Leu Ser Ala Leu 
3795 3800 3805 

Ala Val Gly Glu Ala Asp Ala Gly Leu Ala Glu Gly Ala Ser Gly Ala 
3810 3815 3820 

Gly Arg Thr Ala Phe Leu Phe Ser Gly Gin Gly Ser Gin Arg Leu Gly 

3830 3835 3840 

Met Gly Trp Glu Leu Tyr Ala Arg Tyr Pro Val Phe Ala Asp Ala Phe 
3845 3850 3355 

Asp Ala Val Cys Ala Ala Leu Asp Glu His Leu Glu Arg Pro Leu Ara 
3860 3865 3870 

Asp Val Val Trp Gly Glu Asp Ala Glu Leu Leu Asn Gin Thr Ala Tyr 
3875 3880 3885 

Ala Gin Ala Gly Leu Phe Ala He Glu Val Ala Leu Tyr Arg Leu Ala 
3890 3895 3900 

Glu Ser Trp Gly Met Arg Pro Asp Phe Val Ala Gly His Ser He Glv 

3910 3915 3920 

Glu Val Ala Ala Ala His Val Ser Gly Val Phe Ser Leu Pro Asp Ala 
3925 3930 3935 

Cys Ala Leu Val Ala Ala Arg Gly Arg Leu Met Gin Gin Leu Pro Ser 
3940 3945 3950 

Gly Gly Ala Met Met Ala He Arg Ala Thr Glu Asp Glu Val Leu Pro 
3955 3960 3955 



His Leu Ala Glu Gly Val Ser Leu Ala Ala Val Asn Gly Pro Ser Ser 
3970 3975 3980 

Val Val He Ser Gly Ala Glu Asp Ala Val Leu Ala He Ala Ala His 

3990 3995 4OOO 

Phe Ala Gly Glu Gly Arg Lys Thr Thr Arg Leu Arg Val Ser His Ala 
4005 4010 4015 
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Phe His Ser Pro Leu Met Glu Pro Met Leu Glu Glu Phe Arg Ala Val 
4020 4025 4030 

Val Thr Arg Leu Ser Phe Gly Thr Pro Thr lie Pro Val Val Ser Asn 
4035 4040 4045 

Leu Thr Gly Arg Leu Ala Glu Pro Glu Gin Leu Ala His Ala Asp Tyr 
4050 4055 4060 

Trp Val Arg His Val Arg Glu Ala Val Arg Phe Ala Asp Gly He Gin 
065 4070 4075 4080 

Ala Leu Arg Ala Glu Gly Val Thr Arg Phe Leu Glu Leu Gly Pro Asp 
4085 4090 4095 

Gly Val Leu Ser Ala Met Ala Arg Glu Ser Ala Ser Asp Asp Ala Val 
4100 4105 4110 

Leu Ala Pro Val Leu Arg Arg Asp Arg Pro Glu Glu Thr Ala Leu Leu 
4115 4120 4125 

Gly Ala Leu Ala Gin Leu Tyr Val Arg Gly Ala His Val Asp Trp Thr 
4130 4135 4140 

Val Pro Phe Ala Gly Ser Gly Ala Arg Trp Ala Asp Leu Pro Thr Tyr 
145 4150 4155 4160 

Ala Phe Gin His Glu Arg Phe Trp Pro Ser Gly Gly Val Ala Arg Pro 
4165 4170 4175 

Gly Asp Val Arg Ser Ala Gly Leu Gly Ser Ala Gly His Pro Leu Leu 
4180 4185 4190 

Gly Ala Ala Val Glu Leu Ala Gly Ser Gly Gly Leu Leu Phe Thr Gly 
4195 4200 4205 

Arg Leu Ser Val Ser Ser His Pro Trp Leu Ala Asp His Val Val Leu 
4210 4215 4220 

Gly Ser Val Leu Val Pro Gly Thr Ala Leu Val Glu Leu Val Leu Arg 
225 4230 4235 4240 

Ala Ala Asp Glu Ala Gly Cys Asp Leu Leu Glu Glu Leu Thr Leu Ala 
4245 4250 4255 

Ala Pro Leu Val Leu Pro Ala Ser Gly Ala Ala Val Gin Val Gin Val 
4260 4265 4270 

Ala Val Gly Glu Pro Asp Glu Ala Gly Arg Arg Pro Val Ser Val His 
4275 4280 4285 

Ala Arg Glu Gly Glu Gly Pro Trp Thr Leu His Ala Ser Gly Ala Val 
4290 4295 4300 

Thr Ser Gly Ala Glu Val Pro Pro Phe Asp Ala Thr Val Trp Pro Pro 
-3DS 4310 4315 •- 4320 

Lys Gly Ala Glu Pro Val Asp Val Ala Asp Cys Tyr Asp Val Leu Ala 
4325 4330 4335 

Asp Ala Gly Leu Thr Tyr Gly Pro Ala Phe His Gly Leu Gin Ala Ala 
4340 4345 4350 



Trp Lys Leu Gly Gly Asp Val Tyr Ala Glu Ala Lys Leu Pro Glu Ser 
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4355 4360 4365 

Thr Asp Gly Asp Ala Tyr Gly Leu His Pro Ala Leu Phe Asp Ala Ala 
4370 4375 4380 

Leu His Ala Ser Ala Leu Gly Gly Ala Glu Ala Gly Gly Val Pro Phe 
385 4390 4395 4400 

Ser Trp Ala Gly Val Ser Leu His Ala Thr Gly Ala Ser His Leu Arg 
4405 4410 4415 

Val Arg He Arg Glu Ala Gly Gly Ala Leu Ser Val Ala He Ala Asp 
4420 4425 4430 

Thr Ser Gly Ala Pro Val Ala Ser Val Glu Ser Leu Val He Arg Pro 
4435 4440 4445 

Leu Ser Ala Gly Gin Val Gin Ala Ala Asp Arg Asp Ala Leu Phe Lys 
4450 4455 4460 

Ala Asp Trp Val Pro Val Pro Leu Thr Asp Glu Arg Val Glu Pro Gly 
465 4470 4475 4480 

Thr Gly Pro Glu Gly Glu Pro Leu Arg Thr Tyr Ala Asp Leu Asp Ser 
4485 4490 4495 

Leu Glu Gly Ala Ala Val Pro Gly Thr Val Leu Val Ala Pro Pro Ser 
4500 4505 4510 

Gly Ala Ala Gly Thr Val Glu Ser Val His Ala Ala Thr Val Trp Ala 
4515 4520 4525 

Leu Glu Met Val Gin Ala Trp Leu Ala Asp Asp Arg Phe Ala Thr Ser 
4530 4535 4540 

Arg Leu Val Phe Val Thr Arg Gly Ala Ala Phe Gly Ala Asp Leu Ala 
545 4550 4555 4560 

Ala Ala Ala Val Arg Gly Leu Val Arg Ser Ala Gin Ser Glu Asn Pro 
4565 4570 4575 

Gly Arg Phe Gly Leu Val Asp Met Asp Gly Asp Ala Asp Thr Thr Val 
4580 4585 4590 

Pro Ala Gin Ala Leu Ala Thr Asp Glu Pro Glu Leu Leu Val Arg Gly 
4595 4600 4605 

Gly Glu Val Leu Ala Ala Arg Leu Val Arg Ala Gin Ser Ser His Thr 
4610 4615 4620 

Val Thr Trp Asp Pro Ser Gly Thr Val Leu He Thr Gly Gly Thr Glv 
"5 4630 4635 4640 

Gly Leu Gly Arg Ser Val Ala Arg His Leu Val Ser Glu His Gly Val 
4645 4650 4655 



Aig Ser Leu Leu Leu Val Ser A x g Arg Gly Pro Ala- Ala Glu Gly Ala 
4660 4665 4670 

Gly Glu Leu Val Ala Glu Leu Arg Gly Ser Gly Ala Glu Val Val He 
4675 4680 4685 

Glu Ala Cys Asp Val Thr Asp Ala Val Ala Val Ala Asp Leu Val Ala 
4690 4695 4700 



71 



Arg His Arg lie Ser Ala Val Val His Thr Ala Gly Val Leu Asp Asp 
705 4710 4715 4720 

Gly Val Val Glu Ser Leu Thr Pro Glu Arg Leu Ala Val Val Leu Arg 
4725 4730 4735 

Pro Lys Val Asp Ala Ala Trp Asn Leu His Glu Ala Thr Arg Gly Leu 
4740 4745 4750 

Asp Leu Asp Ala Phe Val Val Phe Ser Ser Val Ala Gly Thr Phe Gly 
4755 4760 4765 

Ser Ala Gly Gin Ala Asn Tyr Ala Ala Gly Asn Ala Phe Leu Asp Ala 
4770 4775 4780 

Leu Ala Tyr His Arg Arg Ala Val Gly Leu Pro Ala Val Ser Leu Ala 
785 4790 4795 4800 

Trp Gly Pro Trp Ser Gin Asp Gly Gly Met Thr Gly Thr Leu Ser Asp 
4805 4810 48X5 

Ala Asp Val Gin Arg lie Ala Arg Gin Gly Met Pro Pro Leu Thr Val 
4820 4825 4830 

Glu Glu Gly Leu Ala Leu Phe Asp Ala Ala Leu Gly Ser Ala Glu Pro 
4835 4840 4845 

Met Ala Leu Pro Val Arg Leu Asp Leu Ala Ala Leu Arg Ala Gin Gly 
4850 4855 4860 

Glu Pro Gin Pro Leu Leu Arg Gly Leu lie Arg Thr Arg Thr Arg Arg 
865 4870 4875 4880 

Ser Gly Ala Ala Ala Ala Ser Gly lie Ala Gin Arg Leu Ala Gly Leu 
4885 4890 4895 

Ser Thr Ala Glu Arg Arg Glu Ala Leu Leu Asp Val Val Arg Ala Gin 
4900 4905 4910 

lie Ala Thr Val Leu Gly His Ala Gly Pro Glu Thr lie Ala Pro Asp 
4915 4920 4925 

Arg Ala Phe Gin Asp Leu Gly Leu Asp Ser Leu Thr Ala lie Glu Leu 
4930 4935 4940 

Arg Asn Leu Leu Gly Lys Ala Thr Gly Leu Arg Leu Pro Ala Thr Thr 
945 4950 4955 4960 

Val Phe Asp Tyr Pro Thr Val Asp Ala Leu Ala Ala His Leu Leu Asp 
4965 4970 4975 

Glu Leu Phe Gly Ala Glu Thr Gly Thr Ala Thr Glu Thr Pro Leu Pro 
4980 4985 4990 

Val Pro Gly Leu Pro Ser Leu Ala Asp Asp Pro Val Val lie Val Gly 
4995 5000 5005. 

Met Ser Cys Arg Phe Pro Gly Gly Val Ala Ser Pro Glu Asp Leu Trp 
5010 5015 5020 

Arg Leu Val Ala Asp Gly Val Asp Ala Val Ser Ala Phe Pro Thr Asp 
025 5030 5035 5040 



Arg Gly Trp Glu lie Asp Asp Thr Tyr Asp Pro Glu Arg Glu Gly Ala 
5045 5050 5055 



72 



lie Ala Thr Arg Ser Gly Gly Phe Leu His Asp Ala Ala Glu Phe Asp 
5060 5065 5070 

Pro Glu Phe Phe Gly Met Ser Pro Arg Glu Ala Leu Thr Thr Asp Ala 
5075 5080 5085 

Gin Gin Arg Leu Leu Leu Glu Thr Thr Trp Glu Ala Leu Glu Ara Ala 
5090 5095 5100 

Gly Met Asp Pro Ala Thr Leu Arg Gly Ser Arg Thr Gly Val Phe Ala 
105 5110 5115 5120 

Gly Val Met Tyr His Asp Tyr Ser Thr Leu Leu Ser Gly Arg Glu Phe 
5125 5130 5135 

Glu Gly Tyr Gin Gly Ser Gly Ser Ala Gly Ser Val Ala Ser Gly Arg 
5140 5145 5150 

Val Ser Tyr Thr Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr 
5155 5160 5165 

Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Ala Gin Ser Leu 
5170 5175 5180 

Arg Ser Gly Glu Cys Ser Leu Ala Leu Ala Gly Gly Val Thr Val Met 
185 5190 5195 5200 

Ser Thr Pro Leu Thr Phe Val Glu Phe Ser Arg Gin Gly Gly Leu Ser 
5205 5210 5215 

Ala Asp Gly Arg Cys Lys Ala Phe Ala Asp Ala Ala Asp Gly Val Gly 
5220 5225 5230 

Trp Ala Glu Gly Ala Gly He Leu Val Leu Glu Arg Leu Ser Asp Ala 
5235 5240 5245 

Arg Arg Asn Gly His Arg He Leu Ala Thr Val Arg Gly Ser Ala Val 
5250 5255 5260 

Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ala 
265 5270 5275 5280 

Gin Gin Arg Val He Arg Gin Ala Leu Ala Ser Ala Gly Leu Ser Ala 
5285 5290 5295 

Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly 
5300 5305 5310 

Asp Pro He Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin Glu Arg 
5315 5320 5325 

Pro Glu Asp Arg Pro Leu Leu Leu Gly Ser Val Lys Ser Asn He Gly 
5330 5335 5340 

His Ala Gin Ala Ala Ser Gly Val Ala Gly Val He Lys Met Val Leu 
"3^5 5355 — 536 0 

Ala Met Arg His Gly Val Leu Pro Arg Thr Leu His Val Asp Glu Pro 
5365 5370 5375 

Ser Ser His Val Asp Trp Ser Ala Gly Ala Val Glu Leu Leu Thr Ser 
5380 5385 5390 

Glu Ala Glu Trp Pro Gin Gly Glu Gly Pro Arg Arg Ala Gly Val Ser 



73 



5395 



5400 



5405 



Ser Phe Gly lie Ser Gly Thr Asn Ala His Val lie Leu Glu Gin Pro 
5410 5415 5420 

Glu Pro Val Ala Ala Glu Thr Glu Ser lie Thr Pro Asp Thr Ala Pro 
425 5430 5435 5440 

Asp Ala Ala Glu Asp Glu Ala Ala Asp Ser Gly Thr Pro Val Pro Ala 
5445 5450 5455 

lieu Leu Ser Gly Arg Ser Ala Ser Ala Leu Arg Ala Gin Ala Ala Arg 
5460 5465 5470 

Leu Leu Ser Arg Leu Asp Gly Asp Pro Gly Pro Arg lie Thr Asp Val 
5475 5480 5485 

Ala Tyr Ser Leu Ala Thr Gly Arg Ser Ala Phe Pro His Arg Ala Val 
5490 5495 5500 

lie Leu Ala Ala Asn Arg Ala Asp Leu Leu His Ser Leu Ser Ala Leu 
505 5510 5515 5520 

Ala Glu Gly His Thr Glu Ala Pro Ala Val Val Ala Gin Asp Arg Ala 
5525 5530 5535 

Arg Ser Gly Lys Leu Ala Phe Leu Phe Ser Gly Gin Gly Ser Gin Arg 
5540 5545 5550 

Leu Gly Met Gly Arg Glu Leu Tyr Gly Arg Tyr Pro Ala Phe Ala Glu 
5555 5560 5565 

Ala Leu Asp Ala Val Cys Ala Ala Leu Asp Ala His Leu Asp Arg Pro 
5570 5575 5580 

Leu Arg Asp Val lie Trp Gly Glu Asp Ala Glu Leu Leu Asn Arg Thr 
585 5590 5595 5600 

Gly Tyr Ala Gin Thr Gly Leu Phe Ala He Glu Val Ala Leu Phe Arg 
5605 5610 5615 

Leu Leu Glu Ser Trp Gly Val Arg Pro Asp His Leu Leu Gly His Ser 
5620 5625 5630 

He Gly Glu lie Ala Ala Ala His Val Ala Gly Val Leu Ser Leu Pro 
5635 5640 5645 

Asp Ala Cys Ala Leu Val Ala Ala Arg Gly Arg Leu Met Gin Gin Leu 
5650 5655 5660 

Pro Ser Gly Gly Ala Met Met Ala He Arg Ala Thr Glu Asp Glu Val 
665 5670 5675 5680 

Leu Pro His Leu Ala Glu Gly Val Ser Leu Ala Ala Val Asn Gly Pro 
5685 5690 5695 

Ser Ser Val Val Val Ser Gly Ala Glu Asp Glu Val Leu Ala Leu Ala 
5700 5705 5710 

Ala His Phe Glu Glu Glu Gly Arg Lys Thr Thr Arg Leu Arg Val Ser 
5715 5720 5725 

His Ala Phe His Ser Pro Leu Met Glu Pro Met Leu Ala Asp Phe Arg 
5730 5735 5740 




DESC 



74 



Ala Val Ala Asp Gly Met Thr Tyr Ala Ala Pro Arg He Pro Val Val 
745 5750 5755 5760 

Ser Asn Val Thr Gly Arg Pro Ala Thr Ala Glu Glu Leu Cys Cys Ala 
5765 5770 5775 

Glu Tyr Trp Val Gly His Val Arg Glu Ala Val Arg Phe Ala Asp Gly 
5780 5785 5790 

Val Gly Ala Leu Arg Glu Gin Gly Val Thr Thr Phe Leu Glu Leu Gly 
5795 5800 5805 

Pro Asp Gly Ser Leu Ser Ala Leu Ala Ala Glu Ser Ala Ala Asp Asp 
5810 5815 5820 

Ser Val Leu Ala Pro Val Leu Arg Lys Asn Arg Pro Glu Ala Pro Ala 
825 5830 5835 5840 

Leu Leu Thr Ala Leu Ala Arg Leu His Ala Gin Gly Thr Pro Val Asp 
5845 5850 5855 

Trp Ser Ala Ala Phe Ala Gly Thr Gly Ala Arg Trp Val Asp Leu Pro 
5860 5865 5870 

Thr Tyr Ala Phe Gin His Glu Arg Phe Trp Pro Ser Gly Gly Ala Ala 
5875 5880 5885 

Arg Ala Gly Asp Val Arg Ser Ala Gly Leu Gly Ser Ala Gly His Pro 
5890 5895 5900 

Leu Leu Gly Ala Ala Val Glu Leu Ala Gly Ser Gly Gly Arg Leu Leu 
905 5910 5915 5920 

Thr Gly Arg Leu Ser Leu Ser Ser His Pro Trp Leu Ala Asp His Val 
5925 5930 5935 

Val Leu Gly Ser Val Leu Val Pro Gly Thr Ala Leu Met Glu Leu Val 
5940 5945 5950 

Leu Arg Ala Ala Asp Glu Val Asp Cys Ala Ala Val Asp Glu Leu Thr 
5955 5960 5965 

Leu Ala Ala Pro Leu Val Leu Pro Ala Ser Gly Ala Ala He Gin Val 
5970 5975 5980 

Gin Val Trp Val Gly Glu Pro Asp Glu Ala Gly Arg Arg Pro Val Ser 
985 5990 5995 6000 

Val His Ala Arg Glu Gly Glu Gly Pro Trp Thr Leu His Ala Asp Gly 
6005 6010 6015 

Ala Leu Ala Pro Ala Ala Glu Thr Val Pro Phe Asp Thr Ala He Trp 
6020 6025 6030 

Pro Pro Gin Gly Ala Glu His Leu Asp Ala Ala Gly Cys Tyr Glu Arg 
6035 6040 6045 



Phe Ala Asp Ala Gly Phe Ala Tyr Gly Pro Val Phe Gin Gly Leu Arg 

6050 6055 6060 

Ala Ala Trp Lys Leu Gly Glu Asp He Tyr Ala Glu Val Ala Leu Pro 

065 6070 6075 6080 

Glu Gly Thr Asp Gly Asn Ala Tyr Gly Leu His Pro Ala Leu Phe Asp 



6085 



6090 



6095 



75 



Ala Ala Leu His Ala Ala Leu Leu Gly Gly Glu Gly Thr Asp Glu Ala 
6100 6105 6110 

Ala Val Pro Phe Ser Trp Asn Gly Val Thr Leu His Ala Thr Gly Ala 
6115 6120 6125 

Ser Arg Val Arg Val Arg lie Arg Pro Thr Glu Gly Gly Thr Ser lie 
6130 6135 6140 

Ala Leu Val Asp Thr Ala Gly Ala Pro Val Ala Ser Val Arg Ser Leu 
145 6150 6155 6160 

Thr Ala Arg Pro lie Thr Ala Gly Gin Leu Gin Thr Gly Asp Arg Asp 
6165 6170 6175 

Ser Leu Phe Gin Val Asp Trp Thr Thr Leu His Leu Thr Asp Glu Arg 
6180 6185 6190 

Ala Asn Ser Leu Ala Leu Leu Gly Lys Asp Thr Glu Gly lie Leu Asp 
6195 6200 6205 

Thr Leu Ser Leu Gin Pro His Ala Asp Leu Asp Asp Leu Ala Ala Thr 
6210 6215 6220 

Gly Val His Asp Thr Val Leu Ala, Pro Leu Pro Thr Arg Thr Ala Gly 
225 6230 6235 6240 

Thr Val Glu Ser Val His Ala Ala Thr Thr Gly Ala Leu Ala Leu He 
6245 6250 6255 

Arg Ser Trp Leu Ala Asp Asp Arg Phe Ala Ala Ser Arg Leu Val Phe 
6260 6265 6270 

Val Thr Arg Gly Ala Val Ser Gly Thr Asp Leu Ala Gly Ala Ser Val 
6275 6280 6285 

Trp Gly Leu Val Arg Ser Ala Leu Leu Glu His Pro Gly Arg Phe Gly 
6290 6295 6300 

Leu Val Asp Val Asp Val Asp Gin Asp Ala Glu Val Pro Leu Val Pro 
305 6310 6315 6320 

Arg Ala Leu Ala Ser Asp Glu Pro Gin Val Leu Val Arg Gly Gly Glu 
6325 6330 6335 

Val Leu Ala Ala Arg Leu Val Arg Ala Gin Ser Ser Asp Thr Val Thr 
6340 6345 6350 

Trp Asp Pro Ser Gly Thr Val Leu He Thr Gly Gly Thr Gly Gly Leu 
6355 6360 6365 

Gly Arg Ser Val Ala Arg His Leu Val Ser Glu His Gly Val Arg Ser 
6370 6375 6380 

Leu L eu Leu Val Ser Arg Arg Gly Pro Ala Ala Glu Gly Val Asp Ala 
385 53^7J B3SS 6400 

Leu Val Ala Glu Leu Ala Glu Cys Gly Ala Gin Val Thr Val Glu Ala 
6405 6410 6415 

Cys Asp Val Thr Asp Ala Val Ala Val Ala Asp Leu Val Ala Arg His 
6420 6425 6430 

Arg He Ser Ala Val Val His Thr Ala Gly Val Leu Asp Asp Gly Val 




6435 



6440 



6445 



Val Glu Ser Leu Thr Pro Glu Arg Leu Ser Ala Val Leu Arg Pro Lys 
6450 6455 6460 

Val Asp Ala Ala Trp Asn Leu His Glu Ala Thr Arg Gly Leu Asp Leu 
465 6470 6475 6480 

Asp Ala Phe Val Val Phe Ser Ser Val Ala Gly Thr Phe Gly Ser Ala 
6485 6490 6495 

Gly Gin Ala Asn Tyr Ala Ala Gly Asn Ala Phe Leu Asp Ala Leu Ala 
6500 6505 6510 

Tyr His Arg Arg Ala Val Gly Leu Pro Ala Val Ser Leu Ala Trp Gly 
65X5 6520 6525 

Pro T3rp Ser Gin Asp Gly Gly Met Thr Gly Thr Leu Ser Asp Ala Asp 
6530 6535 6540 

Val Gin Arg He Ala Arg Gin Gly Met Pro Pro Leu Thr Val Glu Glu 
545 6550 6555 6560 

Gly Leu Ala Leu Phe Asp Ala Ala Leu Gly Ser Ala Glu Pro Met Ala 
6565 6570 6575 

Leu Pro Val Arg Leu Asp Leu Ala Ala Leu Arg Ala Gin Gly Glu Pro 
6580 6585 6590 

Gin Pro Leu Leu Arg Gly Leu He Arg Thr Pro Gly Arg Arg Thr Ala 
6595 6600 6605 

Ala Ala Ala Thr Glu Gly Asp Thr Ala Ala Ala Phe Ala Gly Arg Leu 
6610 6615 6620 

Thr Gly Leu Ser Ala Ala Glu Gly Arg Glu Val Val Leu Gly Ala Val 
625 6630 6635 6640 

Arg Ser Gin He Ala Gly Val Leu Gly His Ala Glu Ala Thr Glu lie 
6645 6650 6655 

Asp Gin Asp Arg Ala Phe Leu Asp Leu Gly Phe Asp Ser Leu Thr Ala 
6660 6665 6670 

Val Glu Leu Arg Asn Arg Leu Gly Ala Val Thr Gly He Arg Leu Pro 
6675 6680 6685 

Ala Thr Leu Leu Phe Asp Tyr Pro Thr Pro Ala Glu Leu Val Ala His 
6690 6695 6700 

Leu His Ala Arg He Ala Pro Glu Pro Thr Val Gly Pro Glu Ala Leu 
705 6710 6715 6720 

Leu Gly Glu Leu Glu Arg Met Glu Lys Ser Phe Gly Gly Leu Asp Leu 
6725 6730 6735 

'Xlir Glu Glu Me t His Glu Gin He Ala Gly Acg Leu- Glu Val- Leu Axg 
6740 6745 6750 

Ala Lys Trp Asp Ala Leu Arg Asp Thr Ala Ala Ala Ala Gly His Asp 
6755 6760 6765 

Gly Ser Pro Ser Asp Glu Asp Phe Asp Phe Glu Ser Ala Ser Asp Asp 



6770 



6775 



6780 



p. 



77 



Glu Val Phe Asp Leu Leu Asp Asn Glu Leu Gly Leu Ser 
785 6790 6795 



<210> 5 
<211> 1650 
<212> DNA 

<213> Streptomyces natal ensis 

<220> 

<221> CDS 

<222> (1) . . (1650) 

<223> ORFl 

<400> 5 

atg ttc gag aac cag cat ctg tea egg egc cgt ctg etc gga ctg gee 48 
Met Phe Glu Asn Gin His Leu Ser Arg Arg Arg Leu Leu Gly Leu Ala 
1 5 10 15 . 

gee etc ggc ggc gee gea gcc gcc gga atg acc aeg ate ace tec gee 96 
Ala Leu Gly Gly Ala Ala Ala Ala Gly Met Thr Thr lie Thr Ser Ala 
20 25 30 

cet cat gee gcg gcc gcc gae egg egc agt ceg cag gcc egc age ggc 144 
Pro His Ala Ala Ala Ala Asp Arg Arg Ser Pro Gin Ala Arg Ser Gly 
35 40 45 

teg ttc gta ceg gcc gtg gtg ate ggt aeg gga tac ggc gee gcg gtc 192 
Ser Phe Val Pro Ala Val Val lie Gly Thr Gly Tyr Gly Ala Ala Val 
50 55 60 

tec gcg ctg egg etc ggc gag gee gga att ecc aeg etc atg etc gaa 24 0 
Ser Ala Leu Arg Leu Gly Glu Ala Gly lie Pro Thr Leu Met Leu Glu 
65 70 75 80 

atg ggc cag ctg tgg aac aag ecc gee gae gae ggc aac gt,e ttc tge 2 88 
Met Gly Gin Leu Trp Asn Lys Pro Ala Asp Asp Gly Asn Val Phe Cys 
85 90 95 



gga atg etc teg ecc gae egc egc tee age tgg ttc aag tee egc acc 
Gly Met Leu Ser Pro Asp Arg Arg Ser Ser Trp Phe Lys Ser Arg Thr 
100 105 110 



336 



gag gcc ceg etc ggc teg ttc ctg tgg ctg gat gtg ate aac egc gae 384 
Glu Ala Pro Leu Gly Ser Phe Leu Trp Leu Asp Val lie Asn Arg Asp 
115 120 125 

ate gae ceg tac gcg gga gtg ctg gae aag gtg cac ttc gae cag atg 432 
lie Asp Pro Tyr Ala Gly Val Leu Asp Lys Val His Phe Asp Gin Met 
130 135 140 



teg gtg tac gtg ggg egg ggt gtc ggc ggc ggc teg ctg gtc aac ggc 
Ser Val Tyr Val Gly Arg Gly Val Gly Gly Gly Ser Leu Val Asn Gly 
145 150 155 160 



480 



^gg atg gee gtc gta ncg aag egc trg tac ttc gag gag gtc etc ceg . 52_8_ 

Gly Met Ala Val Val Pro Lys Arg Ser Tyr Phe Glu Glu Val Leu Pro 
165 170 175 



egg gtg gae gee gee gag atg tac gae egg tac ttc ceg egc gee aac 
Arg Val Asp Ala Ala Glu Met Tyr Asp Arg Tyr Phe Pro Arg Ala Asn 
180 185 190 



576 



tec atg etc aag gtg aac cac ate gae aag ggg tgg ttc gag gag aeg 
Ser Met Leu Lys Val Asn His lie Asp Lys Gly Trp Phe Glu Glu Thr 



624 



78 



155 200 205 

gag tgg tac aag ttc gcg egg gtc teg cgc gag cag gcg ggc aag gcg 672 
Glu Trp Tyr Lys Phe Ala Arg Val Ser Arg Glu Gin Ala Gly Lys Ala 
210 215 220 



ggc ctg ggc ace acc ttc gtc ccc aac gtc tac gac ttc gac tac atg 
Gly Leu Gly Thr Thr Phe Val Pro Asn Val Tyr Asp Phe Asp Tyr Met 
225 230 235 240 



720 



egg cgc gag gcg aac ggt gag teg ecc aag tee gcg ctg gcg aec gag 768 
Arg Arg Glu Ala Asn Gly Glu Ser Pro Lys Ser Ala Leu Ala Thr Glu 
245 250 255 

gtc ate tac ggc aac aac cae ggc aaa eag age ctg gac aag ace tac 816 
Val He Tyr Gly Asn Asn His Gly Lys Gin Ser Leu Asp Lys Thr Tyr 
260 265 270 

Ctg gee gee gcg etc ggc ace ggc aag gtc ace ate gag acc ctg cae 864 
Leu Ala Ala Ala Leu Gly Thr Gly Lys Val Thr He Glu Thr Leu His 
275 280 285 

cag gtc agg gcg ate cae eag cag eeg gac ggc age tac gtg ctg tec 912 
Gin Val Arg Ala He His Gin Gin Pro Asp Gly Ser Tyr Val Leu Ser 
290 295 300 

gtg gac eag ate gac aeg gee ggc cag ace gtc gee cae aag gag ate 960 
Val Asp Gin He Asp Thr Ala Gly Gin Thr Val Ala His Lys Glu He 
305 310 315 320 

tec tgc egt cae ctg ttc etc ggc gee ggc age etc ggc tec ace gaa 1008 
Ser Cys Arg His Leu Phe Leu Gly Ala Gly Ser Leu Gly Ser Thr Glu 
325 330 335 

Ctg ctg gtg cgc gee egg gac acc ggc gcg ctg ccc gac etc aac gcc 1056 
Leu Leu Val Arg Ala Arg Asp Thr Gly Ala Leu Pro Asp Leu Asn Ala 
340 345 350 

gag gtc ggc gcg ggc tgg ggc ecc aac ggc aac ate atg acc ggc egg 1104 
Glu Val Gly Ala Gly Trp Gly Pro Asn Gly Asn He Met Thr Gly Arg 
355 360 365 

gcc aac cae gtc tgg aac ccc acc ggg gcc cae eag tec teg ate ccc 1152 
Ala Asn His Val Trp Asn Pro Thr Gly Ala His Gin Ser Ser He Pro 
370 375 380 

get ctg ggc ate gac gac tgg aac aac ccc ace gcc eeg gtc ttc gee 1200 
Ala Leu Gly He Asp Asp Trp Asn Asn Pro Thr Ala Pro Val Phe Ala 
385 390 395 400 

gaa ate gee eeg atg ccc gee ggg ttg gag aec tgg gtc age etc tat 1248 
Glu He Ala Pro Met Pro Ala Gly Leu Glu Thr Trp Val Ser Leu Tyr 
405 410 415 

ctg gcg ate ace aag aac ccc gag cgc ggc ace ttc gtc tac gac aag 1296 
Leu Ala He Thr Lys Asn Pro Glu Arg Gly Thr Phe Val Tyr Asp Lys 

4 20 — 425 430- == 



gcc ace gac egg gcc gcg ctg cgc tgg aeg egg gac eag aac aeg ccc 1344 

Ala Thr Asp Arg Ala Ala Leu Arg Trp Thr Arg Asp Gin Asn Thr Pro 
435 440 445 

gcg gtc aac gcc gee agg teg etc ttc gac cgc ate aac aag gcc aac 1392 

Ala Val Asn Ala Ala Arg Ser Leu Phe Asp Arg He Asn Lys Ala Asn 

450 455 460 



79 



ggc acg atg tac cgc tac gac ctg ttc ggg ccg cag ctg aag aac ttc 1440 
Gly Thr Met Tyr Arg Tyr Asp Leu Phe Gly Pro Gin Leu Lys Asn Phe 
465 470 475 480 

tec gac gac ttc tgc tac cac ccg etc ggc ggc tgc gtc ctg ggc aag 1488 
Ser Asp Asp Phe Cys Tyr His Pro Leu Gly Gly Cys Val Leu Gly Lys 
485 490 495 



gcc acc gac ggg tac ggc egg gtc gee ggc tac cac aac etc tac gtc 
Ala Thr Asp Gly Tyr Gly Arg Val Ala Gly Tyr His Asn Leu Tyr Val 
500 505 510 



1536 



acg gac ggc gcg etc ate ccg ggg tec ate ggg gtc aac ecc ttc gtg 
Thr Asp Gly Ala Leu He Pro Gly Ser lie Gly Val Asn Pro Phe Val 
515 520 525 



1584 



acc ate acg gcg ctg gcc gag egg aac ate gag egg ate ate gcg gag 1632 
Thr He Thr Ala Leu Ala Glu Arg Asn He Glu Arg He He Ala Glu 
530 535 540 

gac gtc aag gee gee tag 1650 
Asp Val Lys Ala Ala 
545 550 



<210> 6 
<211> 550 
<212> PRT 

<213> Streptomyces natalensis 
<400> 6 

Met Phe Glu Asn Gin His Leu Ser Arg Arg Arg Leu Leu Gly Leu Ala 
1 5 10 15 

Ala Leu Gly Gly Ala Ala Ala Ala Gly Met Thr Thr He Thr Ser Ala 
20 25 30 

Pro His Ala Ala Ala Ala Asp Arg Arg Ser Pro Gin Ala Arg Ser Gly 
35 40 45 

Ser Phe Val Pro Ala Val Val He Gly Thr Gly Tyr Gly Ala Ala Val 
50 55 60 

Ser Ala Leu Arg Leu Gly Glu Ala Gly He Pro Thr Leu Met Leu Glu 
65 70 75 80 

Met Gly Gin Leu Trp Asn Lys Pro Ala Asp Asp Gly Asn Val Phe Cys 
85 90 95 

Gly Met Leu Ser Pro Asp Arg Arg Ser Ser Trp Phe Lys Ser Arg Thr 
100 105 110 

Glu Ala Pro Leu Gly Ser Phe Leu Trp Leu Asp Val He Asn Arg Asp 
115 120 125 

He Asp Pro Tyr Ala Gly Val Leu Asp Lys Val His Phe Asp Gin Met 
130 135 140 

Ser Val Tyr Val Gly Arg Gly Val Gly Gly Gly Ser Leu Val Asn Gly 
145 150 155 160 

Gly Met Ala Val Val Pro Lys Arg Ser Tyr Phe Glu Glu Val Leu Pro 
165 170 175 



80 



Arg Val Asp Ala Ala Glu Met Tyr Asp Arg Tyr Phe Pro Arg Ala Aszi 
180 185 190 

Ser Met Leu Lys Val Asn His lie Asp Lys Gly Trp Phe Glu Glu Thr 
195 200 205 

Glu Trp Tyr Lys Phe Ala Arg Val Ser Arg Glu Gin Ala Gly Lys Ala 
210 215 220 

Gly Leu Gly Thr Thr Phe Val Pro Asn Val Tyr Asp Phe Asp Tyr Met 
225 230 235 240 

Arg Arg Glu Ala Asn Gly Glu Ser Pro Lys Ser Ala Leu Ala Thr Glu 
245 250 255 

Val lie Tyr Gly Asn Asn His Gly Lys Gin Ser Leu Asp Lys Thr Tyr 
260 265 270 

Leu Ala Ala Ala Leu Gly Thr Gly Lys Val Thr lie Glu Thr Leu His 
275 280 285 

Gin Val Arg Ala lie His Gin Gin Pro Asp Gly Ser Tyr Val Leu Ser 
290 295 300 

Val Asp Gin lie Asp Thr Ala Gly Gin Thr Val Ala His Lys Glu lie 
305 310 315 320 

Ser Cys Arg His Leu Phe Leu Gly Ala Gly Ser Leu Gly Ser Thr Glu 
325 330 335 

Leu Leu Val Arg Ala Arg Asp Thr Gly Ala Leu Pro Asp Leu Asn Ala 
340 345 350 

Glu Val Gly Ala Gly Trp Gly Pro Asn Gly Asn lie Met Thr Gly Arg 
355 360 365 

Ala Asn His Val Trp Asn Pro Thr Gly Ala His Gin Ser Ser lie Pro 
370 375 380 

Ala Leu Gly lie Asp Asp Trp Asn Asn Pro Thr Ala Pro Val Phe Ala 
385 390 395 400 

Glu lie Ala Pro Met Pro Ala Gly Leu Glu Thr Trp Val Ser Leu Tyr 
405 410 415 

Leu Ala lie Thr Lys Asn Pro Glu Arg Gly Thr Phe Val Tyr Asp Lys 
420 425 430 

Ala Thr Asp Arg Ala Ala Leu Arg Trp Thr Arg Asp Gin Asn Thr Pro 
435 440 445 

Ala Val Asn Ala Ala Arg Ser Leu Phe Asp Arg lie Asn Lys Ala Asn 
450 455 460 

Gly Thr Met Tyr Arg Tyr Asp Leu Phe Gly Pro Gin Leu Lys Asn Phe 
465 470 475 480 



Ser Asp Asp Phe Cys Tyr His Pro Leu Gly Gly Cys Val Leu Gly Lys 
485 490 495 

Ala Thr Asp Gly Tyr Gly Arg Val Ala Gly Tyr His Asn Leu Tyr Val 
500 505 510 



Thr Asp Gly Ala Leu lie Pro Gly Ser lie Gly Val Asn Pro Phe Val 
515 520 525 



81 



Thr He Thr Ala Leu Ala Glu Arg Asn He Glu Arg He lie Ala Glu 
530 535 540 

Asp Val Lys Ala Ala 
545 550 



<210> 7 
<211> 1197 
<212> DNA 

<213> Streptoiiiyces natalensis 

<220> 

<221> CDS 

<222> (1) . . (1197) 

<223> ORF2 



48 



96 



<400> 7 

atg acg tac aca gac ccg gcc gcg ccc gag acg gat ccg ccg gcc gtc 
Met Thr Tyr Thr Asp Pro Ala Ala Pro Glu Thr Asp Pro Pro Ala Val 
1 5 * 10 15 

gac ttt ccg cag cgc aag ccc ggc gtg ccg ttc ccg ccg ccc gac tac 
Asp Phe Pro Gin Arg Lys Pro Gly Val Pro Phe Pro Pro Pro Asp Tyr 
20 25 30 

gcc gac tac cgc gac egg aag ggg etc gtc etc teg cag ctg tec gac 
Ala Asp Tyr Arg Asp Arg Lys Gly Leu Val Leu Ser Gin Leu Ser Asp 
35 40 45 

ggc aaa egg gta tgg ctg gtc ace egg cae gag gac gta cgc gcc gta 
Gly Lys Arg Val Trp Leu Val Thr Arg His Glu Asp Val Arg Ala Val 
50 55 60 

ctg ace age ccg age ate age teg aac ccc gag cae aag gga ttt ccc 
Leu Thr Ser Pro Ser He Ser Ser Asn Pro Glu His Lys Gly Phe Pro 
65 70 75 80 

aac gtc ggg aac ctg ggt gtg ccc aag cag gac cag ate ccg ggc tgg 
Asn Val Gly Asn Leu Gly Val Pro Lys Gin Asp Gin He Pro Gly Trp 
85 90 95 

ttc gtg ggc atg gac tec ccc gag cac gac egg ttc cgc aag gcc etc 
Phe Val Gly Met Asp Ser Pro Glu His Asp Arg Phe Arg Lys Ala Leu 
100 105 110 

ate ccg gag ttc ace gtc egg egg gta cgc gcg atg aag ccc gcg ate 
He Pro Glu Phe Thr Val Arg Arg Val Arg Ala Met Lys Pro Ala He 
115 120 125 

gaa cgc acg gtg gac gee caa ctg gac gcc atg ctg gee gcg ggc aac 
Glu Arg Thr Val Asp Ala Gin Leu Asp Ala Met Leu Ala Ala Gly Asn 
130 135 140 

— aec-gcc gac etc gtc gcc gac ttc gcc ctg cec ate ccc tec ctg gtg 
Thr Ala Asp Leu Val Ala Asp Phe Ala Leu Pro He Pro Ser Leu Val 
145 150 155 160 

ate tec gea ctg etc ggc gtg ccg ccc gcc gac cgc gag ttc ttc gag 528 
He Ser Ala Leu Leu Gly Val Pro Pro Ala Asp Arg Glu Phe Phe Glu 
165 170 175 

tec agg ace cgc gtc ctg gtc tec etc cgc tec tec acc gac gac gac 576 
Ser Arg Thr Arg Val Leu Val Ser Leu Arg Ser Ser Thr Asp Asp Asp 



144 



192 



240 



288 



336 



384 



432 



480 



82 



180 185 190 

egg atg gcc gcc gcc aag gac etc ctg egg tae ate aae egg etc gtg 624 
Arg Met Ala Ala Ala Lys Asp Leu Leu Arg Tyr lie Asn Arg Leu Val 
195 200 205 

gag ate aaa eag aag tgg ggc ggc gac gac etc ate ace egg ctg ctg 672 
Glu lie Lys Gin Lys Trp Gly Gly Asp Asp Leu He Thr Arg Leu Leu 
210 215 220 

gee ace ggt gcc ate gcc cec cac gaa atg tec ggc gtg ctg atg etc 720 
Ala Thr Gly Ala He Ala Pro His Glu Met Ser Gly Val Leu Met Leu 
225 230 235 240 

ctg etc ate gee ggc cac gag ace aeg gcc aae aac ate gee etc ggc 768 
Leu Leu He Ala Gly His Glu Thr Thr Ala Asn Asn He Ala Leu Gly 
245 250 255 

gtg gte ace ctg ctg gcg aae cec caa tgg ate ggc gac gac egg gcc 816 
Val Val Thr Leu Leu Ala Asn Pro Gin Trp He Gly Asp Asp Arg Ala 
260 265 270 

gtg gag gag ace ctg ege tte cac tec gte gee gac ctg gtg tec ctg 864 
Val Glu Glu Thr Leu Arg Phe His Ser Val Ala Asp Leu Val Ser Leu 
275 280 285 

cgc gte gcg gte eag gac gtg gaa ate gcc ggg eag etc ate aag gcg 912 
Arg Val Ala Val Gin Asp Val Glu He Ala Gly Gin Leu He Lys Ala 
290 295 300 

ggc gag gga ate gtg ecg ctg gte gee gcc gcc aat cat gac gag aac 960 
Gly Glu Gly He Val Pro Leu Val Ala Ala Ala Asn His Asp Glu Asn 
305 310 315 320 

gee tte gaa tgc cce cac gcc tte gac ecg tec egg tec gcc cgc cac 1008 
Ala Phe Glu Cys Pro His Ala Phe Asp Pro Ser Arg Ser Ala Arg His 
325 330 335 

cat gtg gcc tte ggc tae ggc gta cac caa tgc ctg gga eag aac ctg 1056 
His Val Ala Phe Gly Tyr Gly Val His Gin Cys Leu Gly Gin Asn Leu 
340 345 350 

gtg egg ate gag atg gaa gte gcg tae egg aaa etc tte gag ege ate 1104 
Val Arg He Glu Met Glu Val Ala Tyr Arg Lys Leu Phe Glu Arg He 
355 360 365 

ecg aac etc gaa etc gee gtg cec ace gac ggg ttg gac ate aag tac 1152 
Pro Asn Leu Glu Leu Ala Val Pro Thr Asp Gly Leu Asp He Lys Tyr 
370 375 380 

gac ggc gtg etc tac ggc ctg aac gag ctg cec gte cgc tgg tag 1197 
Asp Gly Val Leu Tyr Gly Leu Asn Glu Leu Pro Val Arg Trp 
385 390 395 



<210> 8 



<211> 399 ' ~ ^ 

<212> PRT 

<213> Streptoinyces natalensis 
<400> 8 

Met Thr Tyr Thr Asp Pro Ala Ala Pro Glu Thr Asp Pro Pro Ala Val 
15 10 15 



Asp Phe Pro Gin Arg Lys Pro Gly Val Pro Phe Pro Pro Pro Asp Tyr 



83 



20 



25 



30 



Ala Asp Tyx Arg Asp Arg Lys Gly Leu Val lieu Ser Gin Leu Ser Asp 
35 40 45 

Gly Lys Arg Val Trp Leu Val Thr Arg His Glu Asp Val Arg Ala Val 
50 55 60 

Leu Thr Ser Pro Ser lie Ser Ser Asn Pro Glu His Lys Gly Phe Pro 
65 70 75 80 

Asn Val Gly Asn Leu Gly Val Pro Lys Gin Asp Gin lie Pro Gly Trp 
85 90 95 

Phe Val Gly Met Asp Ser Pro Glu His Asp Arg Phe Arg Lys Ala Leu 
100 105 110 

lie Pro Glu Phe Thr Val Arg Arg Val Arg Ala Met Lys Pro Ala lie 
115 120 125 

Glu Arg Thr Val Asp Ala Gin Leu Asp Ala Met Leu Ala Ala Gly Asn 
130 135 140 

Thr Ala Asp Leu Val Ala Asp Phe Ala Leu Pro lie Pro Ser Leu Val 
145 150 155 160 

lie Ser Ala Leu Leu Gly Val Pro Pro Ala Asp Arg Glu Phe Phe Glu 
165 170 175 

Ser Arg Thr Arg Val Leu Val Ser Leu Arg Ser Ser Thr Asp Asp Asp 
180 185 190 

Arg Met Ala Ala Ala Lys Asp Leu Leu Arg Tyr lie Asn Arg Leu Val 
195 200 205 

Glu lie Lys Gin Lys Trp Gly Gly Asp Asp Leu lie Thr Arg Leu Leu 
210 215 220 

Ala Thr Gly Ala lie Ala Pro His Glu Met Ser Gly Val Leu Met Leu 
225 230 235 240 

Leu Leu He Ala Gly His Glu Thr Thr Ala Asn Asn He Ala Leu Gly 
245 250 255 

Val Val Thr Leu Leu Ala Asn Pro Gin Trp He Gly Asp Asp Arg Ala 
260 265 270 

Val Glu Glu Thr Leu Arg Phe His Ser Val Ala Asp Leu Val Ser Leu 
275 280 285 

Arg Val Ala Val Gin Asp Val Glu He Ala Gly Gin Leu He Lys Ala 
290 295 300 

Gly Glu Gly He Val Pro Leu Val Ala Ala Ala Asn His Asp Glu Asn 
305 310 315 320 



'Ala~P he Glu Cys Pr o Hi s A la P he Asp Pro Sex Arg Ser Ala Arg His 



325 



330 



335 



His Val Ala Phe Gly Tyr Gly Val His Gin Cys Leu Gly Gin Asn Leu 
340 345 350 

Val Arg He Glu Met Glu Val Ala Tyr Arg Lys Leu Phe Glu Arg He 
355 360 365 



DE3C 
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Pro Asn Leu Glu Leu Ala Val Pro Thr Asp Gly Leu Asp lie Lys Tyr 
370 375 380 

Asp Gly Val Leu Tyr Gly Leu Asn Glu Leu Pro Val Arg Trp 
385 390 395 



<210> 9 
<211> 1194 
<212> DNA 

<213> Streptomyces natalensis 

<220> 

<221> CDS 

<222> (1) . . (1194) 

<223> 0RP3 



<400> 9 

atg acc gcc gcc tec cac gac ctg ccc tgc etc aac etc gaa ccg ccc 48 

Met Thr Ala Ala Ser His Asp Leu Pro Cys Leu Asn Leu Glu Pro Pro 

1 5 10 15 

aaa atg ctg aaa ctg age ccg ctg ctg egc gcc ttg cag gac egg ggg 96 
Lys Met Leu Lys Leu Ser Pro Leu Leu Arg Ala Leu Gin Asp Arg Gly 
20 25 30 

ccg ate cac egg gtg cgc aca ccc gcc ggg gac gag gcg tgg ctg gtg 144 
Pro lie His Arg Val Arg Thr Pro Ala Gly Asp Glu Ala Trp Leu Val 
35 40 45 

acc cgc cac gcc gag etc aag cag ctg ctg cac gac gag cgc ate ggc 192 
Thr Arg His Ala Glu Leu Lys Gin Leu Leu His Asp Glu Ara He Glv 
50 55 60 

cgc acg cac ccc gac ccg ccc tec gcc gcc cag tac gta cgc age ccc 240 
Arg Thr His Pro Asp Pro Pro Ser Ala Ala Gin Tyr Val Arg Ser Pro 
65 70 75 80 

ttc ctg gac ctg ctg ate age gac gcc gac gcc gag tec ggg egt egg 288 
Phe Leu Asp Leu Leu He Ser Asp Ala Asp Ala Glu Ser Gly Ara Arci 
85 90 95 

cag cac gcc gag acc cgc cgc ctg etc act ccg ttg ttc teg gcc egg 336 
Gin His Ala Glu Thr Arg Arg Leu Leu Thr Pro Leu Phe Ser Ala Arg 
100 105 110 

cgc gtt ctg gaa atg cag ccg aag gtg gag gag gcc gcg gac acc ctg 384 
Arg Val Leu Glu Met Gin Pro Lys Val Glu Glu Ala Ala Asp Thr Leu 
115 120 125 

ctg gac gcg ttc ate gcc cag ggg cct ccc ggc gac ctg cac ggc gag 432 
Leu Asp Ala Phe He Ala Gin Gly Pro Pro Gly Asp Leu His Gly Glu 
130 135 140 

etc ace gtg ccg ttc gee etc acg gte etc tgc gag gtc ate ggc gtg 480 
Leu Thr Val Pro Phe Ala Leu Thr Val Leu Cys Glu Val He Gly Val 

-M^ -. r-^- ISO 155 ifin 



ccg ccg cag egc cgc gcg gag ctg acc aca ctg ctg gee ggt ate gee 528 
Pro Pro Gin Arg Arg Ala Glu Leu Thr Thr Leu Leu Ala Gly He Ala 
165 170 175 

aag ctg gac gac cgc gag ggc gcc gta egg gea cag gac gac ctg ttc 576 
Lys Leu Asp Asp Arg Glu Gly Ala Val Arg Ala Gin Asp Asp Leu Phe 
180 185 190 



85 



ggg tac gtg gca ggg ctg gtc gag cac aag egg gcc gaig ccc ggc cca 
Gly Tyx Val Ala Gly Leu Val Glu His Lys Arg Ala Glu Pro Gly Pro 
195 200 205 



624 



gac ate ate tec egg ctg aac gac ggc gag ctg ace gag gae cgc gtg 
Asp lie lie Ser Arg Leu Asn Asp Gly Glu Leu Thr Glu Asp Arg Val 
210 215 220 



672 



gca cac ctg gcc atg ggc ctg ctg ttc gee ggg ctg gac age gtc gcg 
Ala His Leu Ala Met Gly Leu Leu Phe Ala Gly Leu Asp Ser Val Ala 
225 230 235 240 



720 



age ate atg gac aac ggg gtg gtg ctg ctg gcc gcc cac ccc gat cag 
Ser lie Met Asp Asn Gly Val Val Leu Leu Ala Ala His Pro Asp Gin 
245 250 255 



768 



cgc gcg gcg gcg ctg gcc gac ccc gac gtg atg gcg cgt gcc gtg gag 
Arg Ala Ala Ala Leu Ala Asp Pro Asp Val Met Ala Arg Ala Val Glu 
260 265 270 



816 



gag gtg ctg egg ace gcc egg gcc ggc ggg teg gtc ctg ecg ccg cgc 
Glu Val Leu Arg Thr Ala Arg Ala Gly Gly Ser Val Leu Pro Pro Arg 
275 280 285 



864 



tac gee age gag gae atg gaa ttc ggc ggg gtg acg ata egg gee gga 
Tyr Ala Ser Glu Asp Met Glu Phe Gly Gly Val Thr lie Arg Ala Gly 
290 295 300 



912 



gac ctg gtc ctg ttc gac etc ggc ctg ccc aac ttc gac gag egg gcg 
Asp Leu Val Leu Phe Asp Leu Gly Leu Pro Asn Phe Asp Glu Arg Ala 
305 310 315 320 



960 



ttc aca ggg ccg gag gaa ttc gac gcc gcc agg acc ccc aat ccc cat 
Phe Thr Gly Pro Glu Glu Phe Asp Ala Ala Arg Thr Pro Asn Pro His 
325 330 335 



1008 



ctg acc ttc ggc cac ggc ate tgg cac tge ate ggc gee ccc etc gcg 
Leu Thr Phe Gly His Gly lie Trp His Cys He Gly Ala Pro Leu Ala 
340 345 350 



1056 



cgc ctg gaa etc agg acg atg ttc ace aag ctg ttc ace cgc ctg ecg 
Arg Leu Glu Leu Arg Thr Met Phe Thr Lys Leu Phe Thr Arg Leu Pro 
355 360 365 



1104 



gaa ctg cgc ecg gaa ett ecg gtg gag caa ctg cgc ctg aag gag ggc 
Glu Leu Arg Pro Glu Leu Pro Val Glu Gin Leu Arg Leu Lys Glu Gly 
370 375 380 



1152 



cag ctg teg ggc ggc ttc gcc gag etc egg gtg gtc tgg tag 
Gin Leu Ser Gly Gly Phe Ala Glu Leu Arg Val Val Trp 
385 390 395 



1194 



<210> 10 
<211> 398 



<212> PRT 

<213> Streptotnyces natalensis 
<400> 10 

Met Thr Ala Ala Ser His Asp Leu Pro Cys Leu Asn Leu Glu Pro Pro 
1 5 10 15 



Lys Met Leu Lys Leu Ser Pro Leu Leu Arg Ala Leu Gin Asp Arg Gly 
20 25 30 



DESC 
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Pro lie His Arg Val Arg Thr Pro Ala Gly Asp Glu Ala Trp Leu Val 
35 40 45 

Thr Arg His Ala Glu Leu Lys Gin Leu Leu His Asp Glu Arg He Glv 
50 55 60 

Arg Thr His Pro Asp Pro Pro Ser Ala Ala Gin Tyr Val Arg Ser Pro 
^5 70 75 80 

Phe Leu Asp Leu Leu He Ser Asp Ala Asp Ala Glu Ser Gly Ara Ara 
85 90 95 

Gin His Ala Glu Thr Arg Arg Leu Leu Thr Pro Leu Phe Ser Ala Arg 

100 105 110 

Arg Val Leu Glu Met Gin Pro Lys Val Glu Glu Ala Ala Asp Thr Leu 
115 120 125 

Leu Asp Ala Phe lie Ala Gin Gly Pro Pro Gly Asp Leu His Gly Glu 
130 135 140 

Leu Thr Val Pro Phe Ala Leu Thr Val Leu Cys Glu Val He Gly Val 
145 150 155 160 

Pro Pro Gin Arg Arg Ala Glu Leu Thr Thr Leu Leu Ala Gly He Ala 
165 170 175 

Lys Leu Asp Asp Arg Glu Gly Ala Val Arg Ala Gin Asp Asp Leu Phe 
180 185 190 

Gly Tyr Val Ala Gly Leu Val Glu His Lys Arg Ala Glu Pro Gly Pro 
195 200 205 

Asp He He Ser Arg Leu Asn Asp Gly Glu Leu Thr Glu Asp Arg Val 
210 215 220 

Ala His Leu Ala Met Gly Leu Leu Phe Ala Gly Leu Asp Ser Val Ala 
225 230 235 240 

Ser He Met Asp Asn Gly Val Val Leu Leu Ala Ala His Pro Asp Gin 
245 250 255 

Arg Ala Ala Ala Leu Ala Asp Pro Asp Val Met Ala Arg Ala Val Glu 
260 265 270 

Glu Val Leu Arg Thr Ala Arg 'Ala Gly Gly Ser Val Leu Pro Pro Arg 
275 280 285 

Tyr Ala Ser Glu Asp Met Glu Phe Gly Gly Val Thr He Arg Ala Gly 
290 295 300 

Asp Leu Val Leu Phe Asp Leu Gly Leu Pro Asn Phe Asp Glu Arg Ala 
305 310 315 320 

Phe Thr Gly Pro Glu Glu Phe Asp Ala Ala Arg Thr Pro Asn Pro His 

325 



330 



Leu Thr Phe Gly His Gly He Trp His Cys He Gly Ala Pro Leu Ala 
340 345 350 

Arg Leu Glu Leu Arg Thr Met Phe Thr Lys Leu Phe Thr Arg Leu Pro 
355 360 365 

Glu Leu Arg Pro Glu Leu Pro Val Glu Gin Leu Arg Leu Lys Glu Gly 
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370 



375 



380 



Gin Leu Ser Gly Gly Phe Ala Glu Leu Arg Val Val Trp 
385 390 395 



<210> 11 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Forward primer 
ertnB promoter 

<400> 11 

aaactgcagc tctagaggcg gcttgcgccc gatgctagtc 



40 



<210> 12 
<211> 70 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Reverse primer 
ermE promoter 

<400> 12 

aaactgcagc tctagatggc ccgggtatcg atcgtcgacg gcatgcggat cctaccaacc 60 



ggcacgattg 



70 
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Claims 

1. A nucleotide sequence comprising SEQ ID NO. 5, 

SEQ ID NO. 7 or SEQ ID NO. 9, and homologues or fragments 
thereof . 

5 2 . A nucleotide sequence as depicted by SEQ ID 

NO. 5, SEQ ID NO. 7 or SEQ ID NO; 9, and homologues thereof. 

3. An enzyme obtainable by expressing a 
nucleotide secjuence according to claims 1-2 in a micro- 

10 organism chosen from the group of Streptomyces species and 
heterologous species. 

4. An enzyme comprising the amino acid sequence 
as depicted by SEQ ID NO. 6, SEQ ID NO. 8, or SEQ ID NO. 10, 

15 and homologues and fragments thereof. 

5. A method for overexpressing a gene encoding an 
enzyme according to claim 3 or 4 in Streptomyces by attaching 
a promoter sequence to the gene, transferring the promoter- 

20 gene complex into a cell and bringing the gene to esqpression. 

6. A method for knocking out a gene encoding an 
enzyme according to claim 3 or 4 in Streptomyces by 
disrupting the coding sequence of the gene. 

25 

7. A method for expressing a gene encoding an 
enzyme according to claim 3 or 4, in a heterologous micro- 
organism by attaching a promoter sequence to the gene, 

tr^ngff^-r-ri ng the promoter-gene complex into a cell and 

30 bringing the gene to expression. 

8. A recombinant microorganism strain obtainable 
by a method according to claims 5-7. 
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9- A method for preparing pimaricin involving the 

use of a reconibinant Streptojnyces obtainable by a method 
according to claim 5. 

10. A method according to claim 9, wherein 
Streptomyces is Streptoinyces natalensis. 

11 . A method for preparing a biomolecule of 
interest involving the use of a recombinant Streptomyces 
obtainable by a method according to claim 6 or 7. 



12. A biomolecule of interest obtainable by a 
method according to claim 11 . 



EP-2959 
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GENES ENCODING ENZYMES IN THE BIOSYNTHESIS OF PIMARICIN AND 

THE APPLICATION THEREOF 

ABSTRACT 

The invention relates to genes encoding enzymes which 
are fundamental in the biosynthesis of pimaricin, and to the 
enzymes which are encoded by said genes. The invention 
further relates the application of said genes for modifying 
the biosynthesis of pimaricin, as well as for the 
biosynthesis of new conqpounds. 
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